FileMood

Download Reinforcement Learning Specialization

Reinforcement Learning Specialization

Name

Reinforcement Learning Specialization

 DOWNLOAD Copy Link

Total Size

5.0 GB

Total Files

699

Last Seen

2024-07-23 00:00

Hash

E00A4FC3F94EF3FF923884F09A47FFF540D7EE60

/.../03_generalized-policy-iteration/

04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.mp4

152.4 MB

04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.en.srt

41.7 KB

04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.en.txt

21.9 KB

03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.en.srt

12.4 KB

02_efficiency-of-dynamic-programming.en.srt

7.9 KB

03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.en.txt

7.7 KB

01_flexibility-of-the-policy-iteration-framework.en.srt

7.2 KB

02_efficiency-of-dynamic-programming.en.txt

5.0 KB

05_week-4-summary.en.txt

2.4 KB

06_chapter-summary_instructions.html

1.2 KB

05_week-4-summary.en.srt

4.6 KB

01_flexibility-of-the-policy-iteration-framework.en.txt

3.9 KB

06_chapter-summary_RLbook2018.pdf

89.4 MB

03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.mp4

49.4 MB

02_efficiency-of-dynamic-programming.mp4

14.7 MB

01_flexibility-of-the-policy-iteration-framework.mp4

13.0 MB

05_week-4-summary.mp4

10.1 MB

/

TutsNode.net.txt

0.1 KB

[TGx]Downloaded from torrentgalaxy.to .txt

0.6 KB

/.../04_weekly-assessment/

01_sequential-decision-making_quiz.html

215.4 KB

02_bandits-and-exploration-exploitation_instructions.html

1.2 KB

/.../01_course-introduction/

01_course-4-introduction.en.txt

2.3 KB

03_reinforcement-learning-textbook_instructions.html

2.2 KB

04_pre-requisites-and-learning-objectives_A_Complete_Reinforcement_Learning_System_Capstone__Learning_Objectives.pdf

58.2 KB

02_meet-your-instructors.en.srt

13.8 KB

02_meet-your-instructors.en.txt

8.8 KB

01_course-4-introduction.en.srt

4.3 KB

04_pre-requisites-and-learning-objectives_instructions.html

3.7 KB

03_reinforcement-learning-textbook_RLbook2018.pdf

89.4 MB

02_meet-your-instructors.mp4

46.0 MB

01_course-4-introduction.mp4

23.2 MB

/.../04_weekly-assessment/

01_dynamic-programming_quiz.html

161.3 KB

02_optimal-policies-with-dynamic-programming_instructions.html

1.2 KB

/.../01_course-introduction/

04_read-me-pre-requisites-and-learning-objectives_Course_2__Sample_Based_Learning_Methods_Learning_Objectives.pdf

85.1 KB

02_meet-your-instructors.en.srt

13.8 KB

02_meet-your-instructors.en.txt

8.8 KB

01_course-introduction.en.srt

4.1 KB

04_read-me-pre-requisites-and-learning-objectives_instructions.html

3.0 KB

03_reinforcement-learning-textbook_instructions.html

2.2 KB

01_course-introduction.en.txt

2.2 KB

03_reinforcement-learning-textbook_RLbook2018.pdf

89.4 MB

02_meet-your-instructors.mp4

46.0 MB

01_course-introduction.mp4

11.8 MB

/.../01_course-introduction/

06_read-me-pre-requisites-and-learning-objectives_Fundamentals_of_Reinforcement_Learning__Learning_Objectives.pdf

66.2 KB

02_course-introduction.en.txt

5.8 KB

05_reinforcement-learning-textbook_RLbook2018.pdf

89.4 MB

03_meet-your-instructors.en.srt

16.3 KB

02_course-introduction.en.srt

10.7 KB

03_meet-your-instructors.en.txt

8.6 KB

01_specialization-introduction.en.txt

2.7 KB

05_reinforcement-learning-textbook_instructions.html

2.2 KB

06_read-me-pre-requisites-and-learning-objectives_instructions.html

2.7 KB

04_your-specialization-roadmap.en.srt

6.8 KB

01_specialization-introduction.en.srt

5.0 KB

04_your-specialization-roadmap.en.txt

3.5 KB

03_meet-your-instructors.mp4

46.0 MB

02_course-introduction.mp4

34.0 MB

01_specialization-introduction.mp4

19.1 MB

04_your-specialization-roadmap.mp4

15.6 MB

/.../01_course-introduction/

03_read-me-pre-requisites-and-learning-objectives_Prediction_and_Control_with_Function_Approximation_Learning_Objectives.pdf

61.4 KB

02_meet-your-instructors.en.srt

13.8 KB

01_course-3-introduction.en.srt

9.1 KB

02_meet-your-instructors.en.txt

8.8 KB

01_course-3-introduction.en.txt

4.8 KB

04_reinforcement-learning-textbook_instructions.html

2.2 KB

03_read-me-pre-requisites-and-learning-objectives_instructions.html

3.3 KB

04_reinforcement-learning-textbook_RLbook2018.pdf

89.4 MB

02_meet-your-instructors.mp4

46.0 MB

01_course-3-introduction.mp4

17.1 MB

/.../04_off-policy-learning-for-prediction/

04_emma-brunskill-batch-reinforcement-learning.en.srt

25.5 KB

04_emma-brunskill-batch-reinforcement-learning.en.txt

13.5 KB

03_off-policy-monte-carlo-prediction.en.srt

8.0 KB

02_importance-sampling.en.srt

6.7 KB

01_why-does-off-policy-learning-matter.en.srt

6.1 KB

05_week-1-summary.en.srt

5.7 KB

03_off-policy-monte-carlo-prediction.en.txt

4.2 KB

01_why-does-off-policy-learning-matter.en.txt

3.9 KB

02_importance-sampling.en.txt

3.6 KB

05_week-1-summary.en.txt

3.0 KB

06_chapter-summary_instructions.html

1.2 KB

06_chapter-summary_RLbook2018.pdf

89.4 MB

04_emma-brunskill-batch-reinforcement-learning.mp4

39.2 MB

01_why-does-off-policy-learning-matter.mp4

15.1 MB

03_off-policy-monte-carlo-prediction.mp4

13.1 MB

05_week-1-summary.mp4

10.1 MB

02_importance-sampling.mp4

7.8 MB

/.pad/

0

0.0 KB

1

0.0 KB

2

0.3 KB

3

158.7 KB

4

751.9 KB

5

751.9 KB

6

751.9 KB

7

751.9 KB

8

751.9 KB

9

751.9 KB

10

751.9 KB

11

751.9 KB

12

751.9 KB

13

751.9 KB

14

751.9 KB

15

751.9 KB

16

751.9 KB

17

751.9 KB

18

751.9 KB

19

751.9 KB

20

751.9 KB

21

751.9 KB

22

751.9 KB

23

751.9 KB

24

1.0 MB

25

832.5 KB

26

747.4 KB

27

265.2 KB

28

917.3 KB

29

140.9 KB

30

140.9 KB

31

140.9 KB

32

140.9 KB

33

615.9 KB

34

202.8 KB

35

515.8 KB

36

645.1 KB

37

372.0 KB

38

824.1 KB

39

640.2 KB

40

637.0 KB

41

748.5 KB

42

527.4 KB

43

194.0 KB

44

385.3 KB

45

92.6 KB

46

989.8 KB

47

651.8 KB

48

483.1 KB

49

975.6 KB

50

930.5 KB

51

930.5 KB

52

436.1 KB

53

607.2 KB

54

947.2 KB

55

780.9 KB

56

51.4 KB

57

631.0 KB

58

968.2 KB

59

968.2 KB

60

222.2 KB

61

394.2 KB

62

561.6 KB

63

778.3 KB

64

992.4 KB

65

25.9 KB

66

141.9 KB

67

939.1 KB

68

1.0 MB

69

281.7 KB

70

491.8 KB

71

703.3 KB

72

718.5 KB

73

760.6 KB

74

865.3 KB

75

396.7 KB

76

488.5 KB

77

554.7 KB

78

682.0 KB

79

702.6 KB

80

800.2 KB

81

948.4 KB

82

12.7 KB

83

121.9 KB

84

126.2 KB

85

637.0 KB

86

751.9 KB

87

751.9 KB

88

822.2 KB

89

972.2 KB

90

972.2 KB

91

1.0 MB

92

96.9 KB

93

360.3 KB

94

384.8 KB

95

676.5 KB

96

715.0 KB

97

785.0 KB

98

915.9 KB

99

147.6 KB

100

326.8 KB

101

346.4 KB

102

420.2 KB

103

508.2 KB

104

583.6 KB

105

675.0 KB

106

675.0 KB

107

834.6 KB

108

56.6 KB

109

59.9 KB

110

67.4 KB

111

134.9 KB

112

239.8 KB

113

477.4 KB

114

537.7 KB

115

702.1 KB

116

769.6 KB

117

799.2 KB

118

862.4 KB

119

1.0 MB

120

49.1 KB

121

146.4 KB

122

254.9 KB

123

254.9 KB

124

323.5 KB

125

726.2 KB

126

845.1 KB

127

8.4 KB

128

43.0 KB

129

44.9 KB

130

104.7 KB

131

197.6 KB

132

197.6 KB

133

406.3 KB

134

425.2 KB

135

426.3 KB

136

432.9 KB

137

433.1 KB

138

433.1 KB

139

547.5 KB

140

720.7 KB

141

901.7 KB

142

901.7 KB

143

948.8 KB

144

129.0 KB

145

525.6 KB

146

1.0 MB

147

125.8 KB

148

170.6 KB

149

170.6 KB

150

182.0 KB

151

252.5 KB

152

323.4 KB

153

388.8 KB

154

388.8 KB

155

453.6 KB

156

617.2 KB

157

622.0 KB

158

648.9 KB

159

798.3 KB

160

1.0 MB

161

371.9 KB

162

557.5 KB

163

770.8 KB

164

770.8 KB

165

155.7 KB

166

326.2 KB

167

606.8 KB

168

824.9 KB

169

868.0 KB

170

674.0 KB

171

782.6 KB

/.../04_weekly-assessment/

02_graded-value-functions-and-bellman-equations_exam.html

31.8 KB

01_practice-value-functions-and-bellman-equations_quiz.html

8.2 KB

/.../03_average-reward/

02_satinder-singh-on-intrinsic-rewards.en.srt

21.5 KB

01_average-reward-a-new-way-of-formulating-control-problems.en.srt

15.5 KB

02_satinder-singh-on-intrinsic-rewards.en.txt

11.2 KB

01_average-reward-a-new-way-of-formulating-control-problems.en.txt

9.7 KB

03_week-3-review.en.srt

4.8 KB

03_week-3-review.en.txt

2.5 KB

02_satinder-singh-on-intrinsic-rewards.mp4

28.2 MB

01_average-reward-a-new-way-of-formulating-control-problems.mp4

20.0 MB

03_week-3-review.mp4

9.3 MB

/.../02_goal-of-reinforcement-learning/

02_michael-littman-the-reward-hypothesis.en.srt

18.9 KB

02_michael-littman-the-reward-hypothesis.en.txt

11.9 KB

01_the-goal-of-reinforcement-learning.en.txt

2.7 KB

01_the-goal-of-reinforcement-learning.en.srt

5.0 KB

02_michael-littman-the-reward-hypothesis.mp4

88.1 MB

01_the-goal-of-reinforcement-learning.mp4

8.4 MB

/.../02_advantages-of-td/

03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.en.srt

16.3 KB

01_the-advantages-of-temporal-difference-learning.en.srt

8.4 KB

02_comparing-td-and-monte-carlo.en.srt

8.3 KB

03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.en.txt

8.3 KB

01_the-advantages-of-temporal-difference-learning.en.txt

4.4 KB

02_comparing-td-and-monte-carlo.en.txt

4.4 KB

04_week-2-summary.en.srt

3.2 KB

04_week-2-summary.en.txt

1.7 KB

03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.mp4

84.1 MB

02_comparing-td-and-monte-carlo.mp4

10.3 MB

01_the-advantages-of-temporal-difference-learning.mp4

9.5 MB

04_week-2-summary.mp4

7.8 MB

/.../02_project-resources/

03_lets-review-average-reward-a-new-way-of-formulating-control-problems.en.srt

15.5 KB

01_lets-review-expected-sarsa.en.txt

2.9 KB

02_lets-review-what-is-q-learning.en.txt

2.7 KB

05_csaba-szepesvari-on-problem-landscape.en.srt

9.8 KB

03_lets-review-average-reward-a-new-way-of-formulating-control-problems.en.txt

9.7 KB

04_lets-review-actor-critic-algorithm.en.srt

9.4 KB

05_csaba-szepesvari-on-problem-landscape.en.txt

6.2 KB

06_andy-and-rich-advice-for-students.en.srt

6.0 KB

02_lets-review-what-is-q-learning.en.srt

5.1 KB

04_lets-review-actor-critic-algorithm.en.txt

5.0 KB

01_lets-review-expected-sarsa.en.srt

4.6 KB

06_andy-and-rich-advice-for-students.en.txt

3.6 KB

05_csaba-szepesvari-on-problem-landscape.mp4

40.7 MB

06_andy-and-rich-advice-for-students.mp4

35.0 MB

03_lets-review-average-reward-a-new-way-of-formulating-control-problems.mp4

20.0 MB

04_lets-review-actor-critic-algorithm.mp4

14.8 MB

02_lets-review-what-is-q-learning.mp4

8.2 MB

01_lets-review-expected-sarsa.mp4

6.6 MB

/.../02_project-resources/

02_lets-review-examples-of-episodic-and-continuing-tasks.en.txt

2.6 KB

01_lets-review-markov-decision-processes.en.srt

9.8 KB

01_lets-review-markov-decision-processes.en.txt

5.3 KB

02_lets-review-examples-of-episodic-and-continuing-tasks.en.srt

4.8 KB

01_lets-review-markov-decision-processes.mp4

13.0 MB

02_lets-review-examples-of-episodic-and-continuing-tasks.mp4

9.6 MB

/.../03_training-neural-networks/

03_david-silver-on-deep-learning-rl-ai.en.srt

15.1 KB

01_gradient-descent-for-training-neural-networks.en.srt

14.3 KB

03_david-silver-on-deep-learning-rl-ai.en.txt

9.7 KB

02_optimization-strategies-for-nns.en.srt

8.6 KB

01_gradient-descent-for-training-neural-networks.en.txt

7.6 KB

02_optimization-strategies-for-nns.en.txt

4.6 KB

04_week-2-review.en.srt

4.2 KB

04_week-2-review.en.txt

2.2 KB

03_david-silver-on-deep-learning-rl-ai.mp4

43.4 MB

01_gradient-descent-for-training-neural-networks.mp4

16.3 MB

02_optimization-strategies-for-nns.mp4

15.0 MB

04_week-2-review.mp4

8.9 MB

/.../01_weekly-learning-goals/

01_meeting-with-niko-choosing-the-learning-algorithm.en.txt

2.9 KB

01_meeting-with-niko-choosing-the-learning-algorithm.en.srt

4.7 KB

01_meeting-with-niko-choosing-the-learning-algorithm.mp4

8.3 MB

/.../03_exploration-vs-exploitation-tradeoff/

04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.en.srt

14.4 KB

01_what-is-the-trade-off.en.srt

12.5 KB

02_optimistic-initial-values.en.srt

8.7 KB

03_upper-confidence-bound-ucb-action-selection.en.srt

7.7 KB

04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.en.txt

7.5 KB

05_week-1-summary.en.txt

2.7 KB

06_chapter-summary_instructions.html

1.2 KB

01_what-is-the-trade-off.en.txt

6.7 KB

02_optimistic-initial-values.en.txt

5.5 KB

05_week-1-summary.en.srt

4.4 KB

03_upper-confidence-bound-ucb-action-selection.en.txt

4.1 KB

06_chapter-summary_RLbook2018.pdf

89.4 MB

01_what-is-the-trade-off.mp4

22.6 MB

02_optimistic-initial-values.mp4

13.8 MB

04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.mp4

12.5 MB

03_upper-confidence-bound-ucb-action-selection.mp4

12.3 MB

05_week-1-summary.mp4

9.9 MB

/.../01_policy-evaluation-prediction/

04_iterative-policy-evaluation.en.srt

14.0 KB

04_iterative-policy-evaluation.en.txt

7.3 KB

03_policy-evaluation-vs-control.en.srt

6.8 KB

01_module-4-learning-objectives_instructions.html

3.1 KB

02_weekly-reading_instructions.html

1.2 KB

03_policy-evaluation-vs-control.en.txt

4.3 KB

02_weekly-reading_RLbook2018.pdf

89.4 MB

04_iterative-policy-evaluation.mp4

19.7 MB

03_policy-evaluation-vs-control.mp4

14.0 MB

/.../02_project-resources/

02_joelle-pineau-about-rl-that-matters.en.srt

14.0 KB

02_joelle-pineau-about-rl-that-matters.en.txt

9.0 KB

01_lets-review-comparing-td-and-monte-carlo.en.srt

8.3 KB

01_lets-review-comparing-td-and-monte-carlo.en.txt

4.4 KB

02_joelle-pineau-about-rl-that-matters.mp4

30.9 MB

01_lets-review-comparing-td-and-monte-carlo.mp4

10.3 MB

/.../02_policy-iteration-control/

02_policy-iteration.en.srt

13.7 KB

02_policy-iteration.en.txt

7.3 KB

01_policy-improvement.en.srt

6.7 KB

01_policy-improvement.en.txt

3.6 KB

02_policy-iteration.mp4

18.7 MB

01_policy-improvement.mp4

10.5 MB

/.../04_policy-parameterizations/

03_gaussian-policies-for-continuous-actions.en.srt

13.1 KB

02_demonstration-with-actor-critic.en.srt

11.1 KB

04_week-4-summary.en.srt

7.2 KB

03_gaussian-policies-for-continuous-actions.en.txt

7.1 KB

01_actor-critic-with-softmax-policies.en.srt

6.1 KB

02_demonstration-with-actor-critic.en.txt

6.0 KB

01_actor-critic-with-softmax-policies.en.txt

3.8 KB

04_week-4-summary.en.txt

3.7 KB

02_demonstration-with-actor-critic.mp4

30.2 MB

03_gaussian-policies-for-continuous-actions.mp4

20.9 MB

01_actor-critic-with-softmax-policies.mp4

17.3 MB

04_week-4-summary.mp4

10.4 MB

/.../01_final-project-milestone-1/

02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.en.srt

12.8 KB

01_initial-project-meeting-with-martha-formalizing-the-problem.en.srt

6.9 KB

02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.en.txt

6.8 KB

01_initial-project-meeting-with-martha-formalizing-the-problem.en.txt

4.3 KB

02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.mp4

40.4 MB

01_initial-project-meeting-with-martha-formalizing-the-problem.mp4

13.9 MB

/.../03_optimality-optimal-policies-value-functions/

01_optimal-policies.en.srt

12.5 KB

03_using-optimal-value-functions-to-get-optimal-policies.en.srt

11.1 KB

02_optimal-value-functions.en.srt

8.5 KB

03_using-optimal-value-functions-to-get-optimal-policies.en.txt

6.9 KB

01_optimal-policies.en.txt

6.6 KB

04_week-3-summary.en.srt

6.5 KB

05_chapter-summary_RLbook2018.pdf

89.4 MB

05_chapter-summary_instructions.html

1.2 KB

02_optimal-value-functions.en.txt

4.6 KB

04_week-3-summary.en.txt

3.4 KB

01_optimal-policies.mp4

19.4 MB

03_using-optimal-value-functions-to-get-optimal-policies.mp4

17.5 MB

04_week-3-summary.mp4

12.5 MB

02_optimal-value-functions.mp4

10.7 MB

/.../04_weekly-assesment/

01_mdps_quiz.html

12.1 KB

02_graded-assignment-describe-three-mdps_peer_assignment_instructions.html

2.4 KB

/.../03_the-objective-for-td/

03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.en.srt

11.6 KB

02_comparing-td-and-monte-carlo-with-state-aggregation.en.srt

7.0 KB

03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.en.txt

6.2 KB

01_semi-gradient-td-for-policy-evaluation.en.srt

4.7 KB

01_semi-gradient-td-for-policy-evaluation.en.txt

2.9 KB

02_comparing-td-and-monte-carlo-with-state-aggregation.en.txt

3.7 KB

03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.mp4

58.0 MB

01_semi-gradient-td-for-policy-evaluation.mp4

16.1 MB

02_comparing-td-and-monte-carlo-with-state-aggregation.mp4

12.1 MB

/.../01_introduction-to-temporal-difference-learning/

04_rich-sutton-the-importance-of-td-learning.en.srt

11.5 KB

03_what-is-temporal-difference-td-learning.en.srt

8.0 KB

04_rich-sutton-the-importance-of-td-learning.en.txt

6.0 KB

03_what-is-temporal-difference-td-learning.en.txt

4.2 KB

01_module-2-learning-objectives_instructions.html

1.8 KB

02_weekly-reading_instructions.html

1.2 KB

02_weekly-reading_RLbook2018.pdf

89.4 MB

04_rich-sutton-the-importance-of-td-learning.mp4

37.4 MB

03_what-is-temporal-difference-td-learning.mp4

10.8 MB

/.../01_weekly-learning-goals/

01_agent-architecture-meeting-with-martha-overview-of-design-choices.en.srt

11.1 KB

01_agent-architecture-meeting-with-martha-overview-of-design-choices.en.txt

5.9 KB

01_agent-architecture-meeting-with-martha-overview-of-design-choices.mp4

16.4 MB

/.../01_introduction-to-monte-carlo-methods/

04_using-monte-carlo-for-prediction.en.srt

10.9 KB

03_what-is-monte-carlo.en.srt

10.8 KB

03_what-is-monte-carlo.en.txt

5.8 KB

04_using-monte-carlo-for-prediction.en.txt

5.7 KB

01_module-1-learning-objectives_instructions.html

3.1 KB

02_weekly-reading_instructions.html

1.2 KB

02_weekly-reading_RLbook2018.pdf

89.4 MB

04_using-monte-carlo-for-prediction.mp4

17.0 MB

03_what-is-monte-carlo.mp4

15.6 MB

/.../02_project-resources/

02_drew-bagnell-on-system-id-optimal-control.en.srt

10.8 KB

03_susan-murphy-on-rl-in-mobile-health.en.srt

10.6 KB

02_drew-bagnell-on-system-id-optimal-control.en.txt

6.9 KB

03_susan-murphy-on-rl-in-mobile-health.en.txt

6.5 KB

01_lets-review-non-linear-approximation-with-neural-networks.en.srt

6.2 KB

01_lets-review-non-linear-approximation-with-neural-networks.en.txt

3.9 KB

02_drew-bagnell-on-system-id-optimal-control.mp4

32.8 MB

03_susan-murphy-on-rl-in-mobile-health.mp4

29.0 MB

01_lets-review-non-linear-approximation-with-neural-networks.mp4

10.1 MB

/.../01_policies-and-value-functions/

05_rich-sutton-and-andy-barto-a-brief-history-of-rl.en.srt

10.7 KB

04_value-functions.en.srt

10.6 KB

02_weekly-reading_instructions.html

1.2 KB

03_specifying-policies.en.srt

7.7 KB

04_value-functions.en.txt

5.7 KB

05_rich-sutton-and-andy-barto-a-brief-history-of-rl.en.txt

5.5 KB

03_specifying-policies.en.txt

4.1 KB

01_module-3-learning-objectives_instructions.html

3.3 KB

02_weekly-reading_RLbook2018.pdf

89.4 MB

05_rich-sutton-and-andy-barto-a-brief-history-of-rl.mp4

51.1 MB

04_value-functions.mp4

22.1 MB

03_specifying-policies.mp4

15.7 MB

/.../01_estimating-values-functions-with-supervised-learning/

03_moving-to-parameterized-functions.en.srt

10.7 KB

04_generalization-and-discrimination.en.srt

8.9 KB

05_framing-value-estimation-as-supervised-learning.en.srt

6.4 KB

03_moving-to-parameterized-functions.en.txt

5.7 KB

04_generalization-and-discrimination.en.txt

4.7 KB

02_weekly-reading-on-policy-prediction-with-approximation_instructions.html

1.2 KB

01_module-1-learning-objectives_instructions.html

4.1 KB

05_framing-value-estimation-as-supervised-learning.en.txt

3.4 KB

02_weekly-reading-on-policy-prediction-with-approximation_RLbook2018.pdf

89.4 MB

03_moving-to-parameterized-functions.mp4

25.6 MB

04_generalization-and-discrimination.mp4

13.5 MB

05_framing-value-estimation-as-supervised-learning.mp4

11.2 MB

/.../04_dealing-with-inaccurate-models/

03_drew-bagnell-self-driving-robotics-and-model-based-rl.en.srt

10.6 KB

02_in-depth-with-changing-environments.en.srt

9.4 KB

01_what-if-the-model-is-inaccurate.en.srt

7.2 KB

03_drew-bagnell-self-driving-robotics-and-model-based-rl.en.txt

6.8 KB

02_in-depth-with-changing-environments.en.txt

5.0 KB

01_what-if-the-model-is-inaccurate.en.txt

3.9 KB

04_week-4-summary.en.srt

2.6 KB

04_week-4-summary.en.txt

1.4 KB

06_text-book-part-1-summary_instructions.html

1.2 KB

05_chapter-summary_instructions.html

1.2 KB

05_chapter-summary_RLbook2018.pdf

89.4 MB

06_text-book-part-1-summary_RLbook2018.pdf

89.4 MB

03_drew-bagnell-self-driving-robotics-and-model-based-rl.mp4

36.9 MB

02_in-depth-with-changing-environments.mp4

12.5 MB

01_what-if-the-model-is-inaccurate.mp4

8.1 MB

04_week-4-summary.mp4

4.5 MB

/.../02_the-objective-for-on-policy-prediction/

04_state-aggregation-with-monte-carlo.en.srt

10.5 KB

02_introducing-gradient-descent.en.srt

10.1 KB

03_gradient-monte-for-policy-evaluation.en.srt

9.5 KB

01_the-value-error-objective.en.srt

6.4 KB

04_state-aggregation-with-monte-carlo.en.txt

6.4 KB

02_introducing-gradient-descent.en.txt

6.3 KB

03_gradient-monte-for-policy-evaluation.en.txt

5.0 KB

01_the-value-error-objective.en.txt

3.5 KB

04_state-aggregation-with-monte-carlo.mp4

21.2 MB

03_gradient-monte-for-policy-evaluation.mp4

16.0 MB

02_introducing-gradient-descent.mp4

15.8 MB

01_the-value-error-objective.mp4

11.4 MB

/.../01_learning-parameterized-policies/

03_learning-policies-directly.en.srt

10.4 KB

04_advantages-of-policy-parameterization.en.srt

7.8 KB

03_learning-policies-directly.en.txt

5.6 KB

04_advantages-of-policy-parameterization.en.txt

4.8 KB

01_module-4-learning-objectives_instructions.html

2.9 KB

02_weekly-reading-policy-gradient-methods_instructions.html

1.2 KB

02_weekly-reading-policy-gradient-methods_RLbook2018.pdf

89.4 MB

04_advantages-of-policy-parameterization.mp4

27.3 MB

03_learning-policies-directly.mp4

17.9 MB

/.../02_bellman-equations/

01_bellman-equation-derivation.en.srt

9.9 KB

02_why-bellman-equations.en.srt

7.2 KB

01_bellman-equation-derivation.en.txt

5.3 KB

02_why-bellman-equations.en.txt

4.5 KB

01_bellman-equation-derivation.mp4

17.9 MB

02_why-bellman-equations.mp4

12.4 MB

/.../01_introduction-to-markov-decision-processes/

03_markov-decision-processes.en.srt

9.8 KB

01_module-2-learning-objectives_instructions.html

2.4 KB

02_weekly-reading_instructions.html

1.2 KB

04_examples-of-mdps.en.srt

7.0 KB

03_markov-decision-processes.en.txt

5.3 KB

04_examples-of-mdps.en.txt

3.8 KB

02_weekly-reading_RLbook2018.pdf

89.4 MB

03_markov-decision-processes.mp4

13.0 MB

04_examples-of-mdps.mp4

12.8 MB

/.../02_project-resources/

02_lets-review-expected-sarsa-with-function-approximation.en.txt

2.1 KB

01_lets-review-optimization-strategies-for-nns.en.srt

8.6 KB

05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.en.srt

8.4 KB

04_meeting-with-martha-in-depth-on-experience-replay.en.srt

7.5 KB

03_lets-review-dyna-q-learning-in-a-simple-maze.en.srt

7.1 KB

05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.en.txt

5.2 KB

04_meeting-with-martha-in-depth-on-experience-replay.en.txt

4.8 KB

01_lets-review-optimization-strategies-for-nns.en.txt

4.6 KB

03_lets-review-dyna-q-learning-in-a-simple-maze.en.txt

4.3 KB

02_lets-review-expected-sarsa-with-function-approximation.en.srt

4.0 KB

05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.mp4

24.7 MB

04_meeting-with-martha-in-depth-on-experience-replay.mp4

22.5 MB

01_lets-review-optimization-strategies-for-nns.mp4

15.0 MB

03_lets-review-dyna-q-learning-in-a-simple-maze.mp4

11.3 MB

02_lets-review-expected-sarsa-with-function-approximation.mp4

8.0 MB

/.../01_feature-construction-for-linear-methods/

04_generalization-properties-of-coarse-coding.en.srt

9.6 KB

06_using-tile-coding-in-td.en.srt

8.5 KB

05_tile-coding.en.srt

5.3 KB

04_generalization-properties-of-coarse-coding.en.txt

5.2 KB

03_coarse-coding.en.srt

5.0 KB

06_using-tile-coding-in-td.en.txt

4.4 KB

02_weekly-reading-on-policy-prediction-with-approximation-ii_instructions.html

1.3 KB

05_tile-coding.en.txt

2.9 KB

01_module-2-learning-objectives_instructions.html

3.2 KB

03_coarse-coding.en.txt

3.1 KB

02_weekly-reading-on-policy-prediction-with-approximation-ii_RLbook2018.pdf

89.4 MB

06_using-tile-coding-in-td.mp4

24.2 MB

04_generalization-properties-of-coarse-coding.mp4

18.8 MB

03_coarse-coding.mp4

10.1 MB

05_tile-coding.mp4

7.9 MB

/.../02_policy-gradient-for-continuing-tasks/

02_the-policy-gradient-theorem.en.srt

9.5 KB

01_the-objective-for-learning-policies.en.srt

9.1 KB

02_the-policy-gradient-theorem.en.txt

5.0 KB

01_the-objective-for-learning-policies.en.txt

4.9 KB

01_the-objective-for-learning-policies.mp4

14.0 MB

02_the-policy-gradient-theorem.mp4

9.8 MB

/.../03_actor-critic-for-continuing-tasks/

02_actor-critic-algorithm.en.srt

9.4 KB

01_estimating-the-policy-gradient.en.srt

7.7 KB

02_actor-critic-algorithm.en.txt

5.0 KB

01_estimating-the-policy-gradient.en.txt

4.7 KB

02_actor-critic-algorithm.mp4

14.8 MB

01_estimating-the-policy-gradient.mp4

14.3 MB

/.../01_the-k-armed-bandit-problem/

02_weekly-reading_RLbook2018.pdf

89.4 MB

03_sequential-decision-making-with-evaluative-feedback.en.srt

8.9 KB

01_module-1-learning-objectives_instructions.html

2.9 KB

02_weekly-reading_instructions.html

1.2 KB

03_sequential-decision-making-with-evaluative-feedback.en.txt

4.8 KB

03_sequential-decision-making-with-evaluative-feedback.mp4

17.1 MB

/.../01_episodic-sarsa-with-function-approximation/

04_episodic-sarsa-in-mountain-car.en.srt

8.9 KB

03_episodic-sarsa-with-function-approximation.en.srt

6.4 KB

04_episodic-sarsa-in-mountain-car.en.txt

4.8 KB

05_expected-sarsa-with-function-approximation.en.srt

4.0 KB

03_episodic-sarsa-with-function-approximation.en.txt

3.9 KB

01_module-3-learning-objectives_instructions.html

2.3 KB

02_weekly-reading-on-policy-control-with-approximation_instructions.html

1.3 KB

05_expected-sarsa-with-function-approximation.en.txt

2.1 KB

02_weekly-reading-on-policy-control-with-approximation_RLbook2018.pdf

89.4 MB

03_episodic-sarsa-with-function-approximation.mp4

18.9 MB

04_episodic-sarsa-in-mountain-car.mp4

16.2 MB

05_expected-sarsa-with-function-approximation.mp4

8.0 MB

/.../06_milestone-5-submit-your-parameter-study/03_congratulations/

01_meeting-with-martha-discussing-your-results.en.txt

2.5 KB

02_course-wrap-up.en.srt

3.0 KB

02_course-wrap-up.en.txt

1.9 KB

03_specialization-wrap-up.en.srt

5.5 KB

01_meeting-with-martha-discussing-your-results.en.srt

4.0 KB

03_specialization-wrap-up.en.txt

3.5 KB

03_specialization-wrap-up.mp4

19.5 MB

01_meeting-with-martha-discussing-your-results.mp4

11.5 MB

02_course-wrap-up.mp4

8.1 MB

/.../04_linear-td/

02_the-true-objective-for-td.en.srt

8.4 KB

03_week-1-summary.en.srt

6.8 KB

01_the-linear-td-update.en.srt

6.4 KB

02_the-true-objective-for-td.en.txt

4.4 KB

03_week-1-summary.en.txt

3.7 KB

01_the-linear-td-update.en.txt

3.4 KB

03_week-1-summary.mp4

17.1 MB

02_the-true-objective-for-td.mp4

14.3 MB

01_the-linear-td-update.mp4

10.4 MB

/.../01_weekly-learning-goals/

01_meeting-with-adam-parameter-studies-in-rl.en.srt

8.3 KB

01_meeting-with-adam-parameter-studies-in-rl.en.txt

5.2 KB

01_meeting-with-adam-parameter-studies-in-rl.mp4

12.0 MB

/.../02_what-to-learn-estimating-action-values/

02_estimating-action-values-incrementally.en.srt

8.2 KB

01_learning-action-values.en.srt

7.1 KB

02_estimating-action-values-incrementally.en.txt

4.4 KB

01_learning-action-values.en.txt

3.9 KB

02_estimating-action-values-incrementally.mp4

20.3 MB

01_learning-action-values.mp4

14.9 MB

/.../03_dyna-as-a-formalism-for-planning/

02_the-dyna-algorithm.en.srt

8.0 KB

01_the-dyna-architecture.en.srt

7.1 KB

03_dyna-q-learning-in-a-simple-maze.en.srt

7.1 KB

01_the-dyna-architecture.en.txt

4.4 KB

03_dyna-q-learning-in-a-simple-maze.en.txt

4.3 KB

02_the-dyna-algorithm.en.txt

4.3 KB

02_the-dyna-algorithm.mp4

11.8 MB

03_dyna-q-learning-in-a-simple-maze.mp4

11.3 MB

01_the-dyna-architecture.mp4

10.1 MB

/.../03_continuing-tasks/

01_continuing-tasks.en.srt

7.8 KB

02_examples-of-episodic-and-continuing-tasks.en.txt

2.6 KB

03_week-2-summary.en.srt

2.8 KB

03_week-2-summary.en.txt

1.5 KB

02_examples-of-episodic-and-continuing-tasks.en.srt

4.8 KB

01_continuing-tasks.en.txt

4.1 KB

01_continuing-tasks.mp4

13.3 MB

02_examples-of-episodic-and-continuing-tasks.mp4

9.6 MB

03_week-2-summary.mp4

5.7 MB

/.../03_exploration-methods-for-monte-carlo/

01_epsilon-soft-policies.en.srt

7.7 KB

01_epsilon-soft-policies.en.txt

4.9 KB

01_epsilon-soft-policies.mp4

13.3 MB

/.../01_what-is-a-model/

03_what-is-a-model.en.srt

7.7 KB

03_what-is-a-model.en.txt

4.1 KB

04_comparing-sample-and-distribution-models.en.srt

4.0 KB

01_module-4-learning-objectives_instructions.html

3.6 KB

04_comparing-sample-and-distribution-models.en.txt

2.1 KB

02_weekly-reading_instructions.html

1.2 KB

02_weekly-reading_RLbook2018.pdf

89.4 MB

03_what-is-a-model.mp4

11.9 MB

04_comparing-sample-and-distribution-models.mp4

7.0 MB

/.../02_off-policy-td-control-q-learning/

03_how-is-q-learning-off-policy.en.srt

7.4 KB

02_q-learning-in-the-windy-grid-world.en.srt

5.9 KB

01_what-is-q-learning.en.srt

5.1 KB

03_how-is-q-learning-off-policy.en.txt

4.1 KB

02_q-learning-in-the-windy-grid-world.en.txt

3.1 KB

01_what-is-q-learning.en.txt

2.7 KB

03_how-is-q-learning-off-policy.mp4

10.4 MB

01_what-is-q-learning.mp4

8.2 MB

02_q-learning-in-the-windy-grid-world.mp4

7.6 MB

/.../01_weekly-learning-goals/

01_meeting-with-adam-getting-the-agent-details-right.en.srt

7.3 KB

01_meeting-with-adam-getting-the-agent-details-right.en.txt

4.5 KB

01_meeting-with-adam-getting-the-agent-details-right.mp4

13.2 MB

/.../02_exploration-under-function-approximation/

01_exploration-under-function-approximation.en.srt

6.7 KB

01_exploration-under-function-approximation.en.txt

3.6 KB

01_exploration-under-function-approximation.mp4

11.6 MB

/.../02_monte-carlo-for-control/

03_solving-the-blackjack-example.en.srt

6.6 KB

01_using-monte-carlo-for-action-values.en.srt

4.8 KB

02_using-monte-carlo-methods-for-generalized-policy-iteration.en.srt

4.1 KB

03_solving-the-blackjack-example.en.txt

3.5 KB

01_using-monte-carlo-for-action-values.en.txt

2.6 KB

02_using-monte-carlo-methods-for-generalized-policy-iteration.en.txt

2.2 KB

03_solving-the-blackjack-example.mp4

14.6 MB

01_using-monte-carlo-for-action-values.mp4

6.8 MB

02_using-monte-carlo-methods-for-generalized-policy-iteration.mp4

5.4 MB

/.../05_course-wrap-up/

01_congratulations.en.srt

6.5 KB

01_congratulations.en.txt

3.5 KB

01_congratulations.mp4

11.7 MB

/.../01_td-for-control/

03_sarsa-gpi-with-td.en.srt

6.3 KB

04_sarsa-in-the-windy-grid-world.en.srt

4.0 KB

03_sarsa-gpi-with-td.en.txt

3.3 KB

01_module-3-learning-objectives_instructions.html

2.9 KB

04_sarsa-in-the-windy-grid-world.en.txt

2.4 KB

02_weekly-reading_instructions.html

1.2 KB

02_weekly-reading_RLbook2018.pdf

89.4 MB

03_sarsa-gpi-with-td.mp4

7.7 MB

04_sarsa-in-the-windy-grid-world.mp4

6.1 MB

/.../02_neural-networks/

02_non-linear-approximation-with-neural-networks.en.srt

6.2 KB

03_deep-neural-networks.en.srt

6.0 KB

01_what-is-a-neural-network.en.srt

5.6 KB

02_non-linear-approximation-with-neural-networks.en.txt

3.9 KB

01_what-is-a-neural-network.en.txt

3.0 KB

03_deep-neural-networks.en.txt

3.2 KB

03_deep-neural-networks.mp4

16.1 MB

02_non-linear-approximation-with-neural-networks.mp4

10.1 MB

01_what-is-a-neural-network.mp4

7.4 MB

/.../01_notebook-grading-faqs/

01__resources.html

5.6 KB

/.../05_planning-learning-acting/02_planning/

01_random-tabular-q-planning.en.srt

5.5 KB

01_random-tabular-q-planning.en.txt

3.0 KB

01_random-tabular-q-planning.mp4

8.2 MB

/.../03_expected-sarsa/

01_expected-sarsa.en.srt

4.6 KB

02_expected-sarsa-in-the-cliff-world.en.srt

3.8 KB

03_generality-of-expected-sarsa.en.srt

2.9 KB

01_expected-sarsa.en.txt

2.9 KB

04_week-3-summary.en.srt

2.7 KB

02_expected-sarsa-in-the-cliff-world.en.txt

2.4 KB

04_week-3-summary.en.txt

1.6 KB

03_generality-of-expected-sarsa.en.txt

1.6 KB

05_chapter-summary_instructions.html

1.3 KB

05_chapter-summary_RLbook2018.pdf

89.4 MB

01_expected-sarsa.mp4

6.6 MB

02_expected-sarsa-in-the-cliff-world.mp4

6.0 MB

03_generality-of-expected-sarsa.mp4

5.5 MB

04_week-3-summary.mp4

3.9 MB

/.../05_course-wrap-up/

01_congratulations-course-4-preview.en.srt

4.3 KB

01_congratulations-course-4-preview.en.txt

2.3 KB

01_congratulations-course-4-preview.mp4

23.2 MB

/.../05_course-wrap-up/

01_congratulations.en.srt

3.4 KB

01_congratulations.en.txt

2.1 KB

01_congratulations.mp4

4.6 MB

 

Total files 699


Copyright © 2024 FileMood.com