Reinforcement Learning Specialization | E00A4FC3F94EF3FF923884F09A47FFF540D7EE60

Reinforcement Learning Specialization

Name	Reinforcement Learning Specialization	DOWNLOAD Copy Link Trouble downloading? see How To
Total Size	5.0 GB
Total Files	699
Last Seen
Hash	E00A4FC3F94EF3FF923884F09A47FFF540D7EE60

/.../03_generalized-policy-iteration/
04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.mp4	152.4 MB

TutsNode.net.txt	0.1 KB
[TGx]Downloaded from torrentgalaxy.to .txt	0.6 KB
/.../04_weekly-assessment/
01_sequential-decision-making_quiz.html	215.4 KB
02_bandits-and-exploration-exploitation_instructions.html	1.2 KB
/.../01_course-introduction/
01_course-4-introduction.en.txt	2.3 KB
03_reinforcement-learning-textbook_instructions.html	2.2 KB
04_pre-requisites-and-learning-objectives_A_Complete_Reinforcement_Learning_System_Capstone__Learning_Objectives.pdf	58.2 KB
02_meet-your-instructors.en.srt	13.8 KB
02_meet-your-instructors.en.txt	8.8 KB
01_course-4-introduction.en.srt	4.3 KB
04_pre-requisites-and-learning-objectives_instructions.html	3.7 KB
03_reinforcement-learning-textbook_RLbook2018.pdf	89.4 MB
02_meet-your-instructors.mp4	46.0 MB
01_course-4-introduction.mp4	23.2 MB
/.../04_weekly-assessment/
01_dynamic-programming_quiz.html	161.3 KB
02_optimal-policies-with-dynamic-programming_instructions.html	1.2 KB
/.../01_course-introduction/
04_read-me-pre-requisites-and-learning-objectives_Course_2__Sample_Based_Learning_Methods_Learning_Objectives.pdf	85.1 KB
02_meet-your-instructors.en.srt	13.8 KB
02_meet-your-instructors.en.txt	8.8 KB
01_course-introduction.en.srt	4.1 KB
04_read-me-pre-requisites-and-learning-objectives_instructions.html	3.0 KB
03_reinforcement-learning-textbook_instructions.html	2.2 KB
01_course-introduction.en.txt	2.2 KB
03_reinforcement-learning-textbook_RLbook2018.pdf	89.4 MB
02_meet-your-instructors.mp4	46.0 MB
01_course-introduction.mp4	11.8 MB
/.../01_course-introduction/
06_read-me-pre-requisites-and-learning-objectives_Fundamentals_of_Reinforcement_Learning__Learning_Objectives.pdf	66.2 KB
02_course-introduction.en.txt	5.8 KB
05_reinforcement-learning-textbook_RLbook2018.pdf	89.4 MB
03_meet-your-instructors.en.srt	16.3 KB
02_course-introduction.en.srt	10.7 KB
03_meet-your-instructors.en.txt	8.6 KB
01_specialization-introduction.en.txt	2.7 KB
05_reinforcement-learning-textbook_instructions.html	2.2 KB
06_read-me-pre-requisites-and-learning-objectives_instructions.html	2.7 KB
04_your-specialization-roadmap.en.srt	6.8 KB
01_specialization-introduction.en.srt	5.0 KB
04_your-specialization-roadmap.en.txt	3.5 KB
03_meet-your-instructors.mp4	46.0 MB
02_course-introduction.mp4	34.0 MB
01_specialization-introduction.mp4	19.1 MB
04_your-specialization-roadmap.mp4	15.6 MB
/.../01_course-introduction/
03_read-me-pre-requisites-and-learning-objectives_Prediction_and_Control_with_Function_Approximation_Learning_Objectives.pdf	61.4 KB
02_meet-your-instructors.en.srt	13.8 KB
01_course-3-introduction.en.srt	9.1 KB
02_meet-your-instructors.en.txt	8.8 KB
01_course-3-introduction.en.txt	4.8 KB
04_reinforcement-learning-textbook_instructions.html	2.2 KB
03_read-me-pre-requisites-and-learning-objectives_instructions.html	3.3 KB
04_reinforcement-learning-textbook_RLbook2018.pdf	89.4 MB
02_meet-your-instructors.mp4	46.0 MB
01_course-3-introduction.mp4	17.1 MB
/.../04_off-policy-learning-for-prediction/
04_emma-brunskill-batch-reinforcement-learning.en.srt	25.5 KB
04_emma-brunskill-batch-reinforcement-learning.en.txt	13.5 KB
03_off-policy-monte-carlo-prediction.en.srt	8.0 KB
02_importance-sampling.en.srt	6.7 KB
01_why-does-off-policy-learning-matter.en.srt	6.1 KB
05_week-1-summary.en.srt	5.7 KB
03_off-policy-monte-carlo-prediction.en.txt	4.2 KB
01_why-does-off-policy-learning-matter.en.txt	3.9 KB
02_importance-sampling.en.txt	3.6 KB
05_week-1-summary.en.txt	3.0 KB
06_chapter-summary_instructions.html	1.2 KB
06_chapter-summary_RLbook2018.pdf	89.4 MB
04_emma-brunskill-batch-reinforcement-learning.mp4	39.2 MB
01_why-does-off-policy-learning-matter.mp4	15.1 MB
03_off-policy-monte-carlo-prediction.mp4	13.1 MB
05_week-1-summary.mp4	10.1 MB
02_importance-sampling.mp4	7.8 MB
.pad/
0	0.0 KB
1	0.0 KB
2	0.3 KB
3	158.7 KB
4	751.9 KB
5	751.9 KB
6	751.9 KB
7	751.9 KB
8	751.9 KB
9	751.9 KB
10	751.9 KB
11	751.9 KB
12	751.9 KB
13	751.9 KB
14	751.9 KB
15	751.9 KB
16	751.9 KB
17	751.9 KB
18	751.9 KB
19	751.9 KB
20	751.9 KB
21	751.9 KB
22	751.9 KB
23	751.9 KB
24	1.0 MB
25	832.5 KB
26	747.4 KB
27	265.2 KB
28	917.3 KB
29	140.9 KB
30	140.9 KB
31	140.9 KB
32	140.9 KB
33	615.9 KB
34	202.8 KB
35	515.8 KB
36	645.1 KB
37	372.0 KB
38	824.1 KB
39	640.2 KB
40	637.0 KB
41	748.5 KB
42	527.4 KB
43	194.0 KB
44	385.3 KB
45	92.6 KB
46	989.8 KB
47	651.8 KB
48	483.1 KB
49	975.6 KB
50	930.5 KB
51	930.5 KB
52	436.1 KB
53	607.2 KB
54	947.2 KB
55	780.9 KB
56	51.4 KB
57	631.0 KB
58	968.2 KB
59	968.2 KB
60	222.2 KB
61	394.2 KB
62	561.6 KB
63	778.3 KB
64	992.4 KB
65	25.9 KB
66	141.9 KB
67	939.1 KB
68	1.0 MB
69	281.7 KB
70	491.8 KB
71	703.3 KB
72	718.5 KB
73	760.6 KB
74	865.3 KB
75	396.7 KB
76	488.5 KB
77	554.7 KB
78	682.0 KB
79	702.6 KB
80	800.2 KB
81	948.4 KB
82	12.7 KB
83	121.9 KB
84	126.2 KB
85	637.0 KB
86	751.9 KB
87	751.9 KB
88	822.2 KB
89	972.2 KB
90	972.2 KB
91	1.0 MB
92	96.9 KB
93	360.3 KB
94	384.8 KB
95	676.5 KB
96	715.0 KB
97	785.0 KB
98	915.9 KB
99	147.6 KB
100	326.8 KB
101	346.4 KB
102	420.2 KB
103	508.2 KB
104	583.6 KB
105	675.0 KB
106	675.0 KB
107	834.6 KB
108	56.6 KB
109	59.9 KB
110	67.4 KB
111	134.9 KB
112	239.8 KB
113	477.4 KB
114	537.7 KB
115	702.1 KB
116	769.6 KB
117	799.2 KB
118	862.4 KB
119	1.0 MB
120	49.1 KB
121	146.4 KB
122	254.9 KB
123	254.9 KB
124	323.5 KB
125	726.2 KB
126	845.1 KB
127	8.4 KB
128	43.0 KB
129	44.9 KB
130	104.7 KB
131	197.6 KB
132	197.6 KB
133	406.3 KB
134	425.2 KB
135	426.3 KB
136	432.9 KB
137	433.1 KB
138	433.1 KB
139	547.5 KB
140	720.7 KB
141	901.7 KB
142	901.7 KB
143	948.8 KB
144	129.0 KB
145	525.6 KB
146	1.0 MB
147	125.8 KB
148	170.6 KB
149	170.6 KB
150	182.0 KB
151	252.5 KB
152	323.4 KB
153	388.8 KB
154	388.8 KB
155	453.6 KB
156	617.2 KB
157	622.0 KB
158	648.9 KB
159	798.3 KB
160	1.0 MB
161	371.9 KB
162	557.5 KB
163	770.8 KB
164	770.8 KB
165	155.7 KB
166	326.2 KB
167	606.8 KB
168	824.9 KB
169	868.0 KB
170	674.0 KB
171	782.6 KB
/.../03_generalized-policy-iteration/
04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.en.srt	41.7 KB
04_warren-powell-approximate-dynamic-programming-for-fleet-management-long.en.txt	21.9 KB
03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.en.srt	12.4 KB
02_efficiency-of-dynamic-programming.en.srt	7.9 KB
03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.en.txt	7.7 KB
01_flexibility-of-the-policy-iteration-framework.en.srt	7.2 KB
02_efficiency-of-dynamic-programming.en.txt	5.0 KB
05_week-4-summary.en.txt	2.4 KB
06_chapter-summary_instructions.html	1.2 KB
05_week-4-summary.en.srt	4.6 KB
01_flexibility-of-the-policy-iteration-framework.en.txt	3.9 KB
06_chapter-summary_RLbook2018.pdf	89.4 MB
03_warren-powell-approximate-dynamic-programming-for-fleet-management-short.mp4	49.4 MB
02_efficiency-of-dynamic-programming.mp4	14.7 MB
01_flexibility-of-the-policy-iteration-framework.mp4	13.0 MB
05_week-4-summary.mp4	10.1 MB
/.../04_weekly-assessment/
02_graded-value-functions-and-bellman-equations_exam.html	31.8 KB
01_practice-value-functions-and-bellman-equations_quiz.html	8.2 KB
/.../03_average-reward/
02_satinder-singh-on-intrinsic-rewards.en.srt	21.5 KB
01_average-reward-a-new-way-of-formulating-control-problems.en.srt	15.5 KB
02_satinder-singh-on-intrinsic-rewards.en.txt	11.2 KB
01_average-reward-a-new-way-of-formulating-control-problems.en.txt	9.7 KB
03_week-3-review.en.srt	4.8 KB
03_week-3-review.en.txt	2.5 KB
02_satinder-singh-on-intrinsic-rewards.mp4	28.2 MB
01_average-reward-a-new-way-of-formulating-control-problems.mp4	20.0 MB
03_week-3-review.mp4	9.3 MB
/.../02_goal-of-reinforcement-learning/
02_michael-littman-the-reward-hypothesis.en.srt	18.9 KB
02_michael-littman-the-reward-hypothesis.en.txt	11.9 KB
01_the-goal-of-reinforcement-learning.en.txt	2.7 KB
01_the-goal-of-reinforcement-learning.en.srt	5.0 KB
02_michael-littman-the-reward-hypothesis.mp4	88.1 MB
01_the-goal-of-reinforcement-learning.mp4	8.4 MB
/.../02_advantages-of-td/
03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.en.srt	16.3 KB
01_the-advantages-of-temporal-difference-learning.en.srt	8.4 KB
02_comparing-td-and-monte-carlo.en.srt	8.3 KB
03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.en.txt	8.3 KB
01_the-advantages-of-temporal-difference-learning.en.txt	4.4 KB
02_comparing-td-and-monte-carlo.en.txt	4.4 KB
04_week-2-summary.en.srt	3.2 KB
04_week-2-summary.en.txt	1.7 KB
03_andy-barto-and-rich-sutton-more-on-the-history-of-rl.mp4	84.1 MB
02_comparing-td-and-monte-carlo.mp4	10.3 MB
01_the-advantages-of-temporal-difference-learning.mp4	9.5 MB
04_week-2-summary.mp4	7.8 MB
/.../02_project-resources/
03_lets-review-average-reward-a-new-way-of-formulating-control-problems.en.srt	15.5 KB
01_lets-review-expected-sarsa.en.txt	2.9 KB
02_lets-review-what-is-q-learning.en.txt	2.7 KB
05_csaba-szepesvari-on-problem-landscape.en.srt	9.8 KB
03_lets-review-average-reward-a-new-way-of-formulating-control-problems.en.txt	9.7 KB
04_lets-review-actor-critic-algorithm.en.srt	9.4 KB
05_csaba-szepesvari-on-problem-landscape.en.txt	6.2 KB
06_andy-and-rich-advice-for-students.en.srt	6.0 KB
02_lets-review-what-is-q-learning.en.srt	5.1 KB
04_lets-review-actor-critic-algorithm.en.txt	5.0 KB
01_lets-review-expected-sarsa.en.srt	4.6 KB
06_andy-and-rich-advice-for-students.en.txt	3.6 KB
05_csaba-szepesvari-on-problem-landscape.mp4	40.7 MB
06_andy-and-rich-advice-for-students.mp4	35.0 MB
03_lets-review-average-reward-a-new-way-of-formulating-control-problems.mp4	20.0 MB
04_lets-review-actor-critic-algorithm.mp4	14.8 MB
02_lets-review-what-is-q-learning.mp4	8.2 MB
01_lets-review-expected-sarsa.mp4	6.6 MB
/.../02_project-resources/
02_lets-review-examples-of-episodic-and-continuing-tasks.en.txt	2.6 KB
01_lets-review-markov-decision-processes.en.srt	9.8 KB
01_lets-review-markov-decision-processes.en.txt	5.3 KB
02_lets-review-examples-of-episodic-and-continuing-tasks.en.srt	4.8 KB
01_lets-review-markov-decision-processes.mp4	13.0 MB
02_lets-review-examples-of-episodic-and-continuing-tasks.mp4	9.6 MB
/.../03_training-neural-networks/
03_david-silver-on-deep-learning-rl-ai.en.srt	15.1 KB
01_gradient-descent-for-training-neural-networks.en.srt	14.3 KB
03_david-silver-on-deep-learning-rl-ai.en.txt	9.7 KB
02_optimization-strategies-for-nns.en.srt	8.6 KB
01_gradient-descent-for-training-neural-networks.en.txt	7.6 KB
02_optimization-strategies-for-nns.en.txt	4.6 KB
04_week-2-review.en.srt	4.2 KB
04_week-2-review.en.txt	2.2 KB
03_david-silver-on-deep-learning-rl-ai.mp4	43.4 MB
01_gradient-descent-for-training-neural-networks.mp4	16.3 MB
02_optimization-strategies-for-nns.mp4	15.0 MB
04_week-2-review.mp4	8.9 MB
/.../01_weekly-learning-goals/
01_meeting-with-niko-choosing-the-learning-algorithm.en.txt	2.9 KB
01_meeting-with-niko-choosing-the-learning-algorithm.en.srt	4.7 KB
01_meeting-with-niko-choosing-the-learning-algorithm.mp4	8.3 MB
/.../03_exploration-vs-exploitation-tradeoff/
04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.en.srt	14.4 KB
01_what-is-the-trade-off.en.srt	12.5 KB
02_optimistic-initial-values.en.srt	8.7 KB
03_upper-confidence-bound-ucb-action-selection.en.srt	7.7 KB
04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.en.txt	7.5 KB
05_week-1-summary.en.txt	2.7 KB
06_chapter-summary_instructions.html	1.2 KB
01_what-is-the-trade-off.en.txt	6.7 KB
02_optimistic-initial-values.en.txt	5.5 KB
05_week-1-summary.en.srt	4.4 KB
03_upper-confidence-bound-ucb-action-selection.en.txt	4.1 KB
06_chapter-summary_RLbook2018.pdf	89.4 MB
01_what-is-the-trade-off.mp4	22.6 MB
02_optimistic-initial-values.mp4	13.8 MB
04_jonathan-langford-contextual-bandits-for-real-world-reinforcement-learning.mp4	12.5 MB
03_upper-confidence-bound-ucb-action-selection.mp4	12.3 MB
05_week-1-summary.mp4	9.9 MB
/.../01_policy-evaluation-prediction/
04_iterative-policy-evaluation.en.srt	14.0 KB
04_iterative-policy-evaluation.en.txt	7.3 KB
03_policy-evaluation-vs-control.en.srt	6.8 KB
01_module-4-learning-objectives_instructions.html	3.1 KB
02_weekly-reading_instructions.html	1.2 KB
03_policy-evaluation-vs-control.en.txt	4.3 KB
02_weekly-reading_RLbook2018.pdf	89.4 MB
04_iterative-policy-evaluation.mp4	19.7 MB
03_policy-evaluation-vs-control.mp4	14.0 MB
/.../02_project-resources/
02_joelle-pineau-about-rl-that-matters.en.srt	14.0 KB
02_joelle-pineau-about-rl-that-matters.en.txt	9.0 KB
01_lets-review-comparing-td-and-monte-carlo.en.srt	8.3 KB
01_lets-review-comparing-td-and-monte-carlo.en.txt	4.4 KB
02_joelle-pineau-about-rl-that-matters.mp4	30.9 MB
01_lets-review-comparing-td-and-monte-carlo.mp4	10.3 MB
/.../02_policy-iteration-control/
02_policy-iteration.en.srt	13.7 KB
02_policy-iteration.en.txt	7.3 KB
01_policy-improvement.en.srt	6.7 KB
01_policy-improvement.en.txt	3.6 KB
02_policy-iteration.mp4	18.7 MB
01_policy-improvement.mp4	10.5 MB
/.../04_policy-parameterizations/
03_gaussian-policies-for-continuous-actions.en.srt	13.1 KB
02_demonstration-with-actor-critic.en.srt	11.1 KB
04_week-4-summary.en.srt	7.2 KB
03_gaussian-policies-for-continuous-actions.en.txt	7.1 KB
01_actor-critic-with-softmax-policies.en.srt	6.1 KB
02_demonstration-with-actor-critic.en.txt	6.0 KB
01_actor-critic-with-softmax-policies.en.txt	3.8 KB
04_week-4-summary.en.txt	3.7 KB
02_demonstration-with-actor-critic.mp4	30.2 MB
03_gaussian-policies-for-continuous-actions.mp4	20.9 MB
01_actor-critic-with-softmax-policies.mp4	17.3 MB
04_week-4-summary.mp4	10.4 MB
/.../01_final-project-milestone-1/
02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.en.srt	12.8 KB
01_initial-project-meeting-with-martha-formalizing-the-problem.en.srt	6.9 KB
02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.en.txt	6.8 KB
01_initial-project-meeting-with-martha-formalizing-the-problem.en.txt	4.3 KB
02_andy-barto-on-what-are-eligibility-traces-and-why-are-they-so-named.mp4	40.4 MB
01_initial-project-meeting-with-martha-formalizing-the-problem.mp4	13.9 MB
/.../03_optimality-optimal-policies-value-functions/
01_optimal-policies.en.srt	12.5 KB
03_using-optimal-value-functions-to-get-optimal-policies.en.srt	11.1 KB
02_optimal-value-functions.en.srt	8.5 KB
03_using-optimal-value-functions-to-get-optimal-policies.en.txt	6.9 KB
01_optimal-policies.en.txt	6.6 KB
04_week-3-summary.en.srt	6.5 KB
05_chapter-summary_RLbook2018.pdf	89.4 MB
05_chapter-summary_instructions.html	1.2 KB
02_optimal-value-functions.en.txt	4.6 KB
04_week-3-summary.en.txt	3.4 KB
01_optimal-policies.mp4	19.4 MB
03_using-optimal-value-functions-to-get-optimal-policies.mp4	17.5 MB
04_week-3-summary.mp4	12.5 MB
02_optimal-value-functions.mp4	10.7 MB
/.../04_weekly-assesment/
01_mdps_quiz.html	12.1 KB
02_graded-assignment-describe-three-mdps_peer_assignment_instructions.html	2.4 KB
/.../03_the-objective-for-td/
03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.en.srt	11.6 KB
02_comparing-td-and-monte-carlo-with-state-aggregation.en.srt	7.0 KB
03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.en.txt	6.2 KB
01_semi-gradient-td-for-policy-evaluation.en.srt	4.7 KB
01_semi-gradient-td-for-policy-evaluation.en.txt	2.9 KB
02_comparing-td-and-monte-carlo-with-state-aggregation.en.txt	3.7 KB
03_doina-precup-building-knowledge-for-ai-agents-with-reinforcement-learning.mp4	58.0 MB
01_semi-gradient-td-for-policy-evaluation.mp4	16.1 MB
02_comparing-td-and-monte-carlo-with-state-aggregation.mp4	12.1 MB
/.../01_introduction-to-temporal-difference-learning/
04_rich-sutton-the-importance-of-td-learning.en.srt	11.5 KB
03_what-is-temporal-difference-td-learning.en.srt	8.0 KB
04_rich-sutton-the-importance-of-td-learning.en.txt	6.0 KB
03_what-is-temporal-difference-td-learning.en.txt	4.2 KB
01_module-2-learning-objectives_instructions.html	1.8 KB
02_weekly-reading_instructions.html	1.2 KB
02_weekly-reading_RLbook2018.pdf	89.4 MB
04_rich-sutton-the-importance-of-td-learning.mp4	37.4 MB
03_what-is-temporal-difference-td-learning.mp4	10.8 MB
/.../01_weekly-learning-goals/
01_agent-architecture-meeting-with-martha-overview-of-design-choices.en.srt	11.1 KB
01_agent-architecture-meeting-with-martha-overview-of-design-choices.en.txt	5.9 KB
01_agent-architecture-meeting-with-martha-overview-of-design-choices.mp4	16.4 MB
/.../01_introduction-to-monte-carlo-methods/
04_using-monte-carlo-for-prediction.en.srt	10.9 KB
03_what-is-monte-carlo.en.srt	10.8 KB
03_what-is-monte-carlo.en.txt	5.8 KB
04_using-monte-carlo-for-prediction.en.txt	5.7 KB
01_module-1-learning-objectives_instructions.html	3.1 KB
02_weekly-reading_instructions.html	1.2 KB
02_weekly-reading_RLbook2018.pdf	89.4 MB
04_using-monte-carlo-for-prediction.mp4	17.0 MB
03_what-is-monte-carlo.mp4	15.6 MB
/.../02_project-resources/
02_drew-bagnell-on-system-id-optimal-control.en.srt	10.8 KB
03_susan-murphy-on-rl-in-mobile-health.en.srt	10.6 KB
02_drew-bagnell-on-system-id-optimal-control.en.txt	6.9 KB
03_susan-murphy-on-rl-in-mobile-health.en.txt	6.5 KB
01_lets-review-non-linear-approximation-with-neural-networks.en.srt	6.2 KB
01_lets-review-non-linear-approximation-with-neural-networks.en.txt	3.9 KB
02_drew-bagnell-on-system-id-optimal-control.mp4	32.8 MB
03_susan-murphy-on-rl-in-mobile-health.mp4	29.0 MB
01_lets-review-non-linear-approximation-with-neural-networks.mp4	10.1 MB
/.../01_policies-and-value-functions/
05_rich-sutton-and-andy-barto-a-brief-history-of-rl.en.srt	10.7 KB
04_value-functions.en.srt	10.6 KB
02_weekly-reading_instructions.html	1.2 KB
03_specifying-policies.en.srt	7.7 KB
04_value-functions.en.txt	5.7 KB
05_rich-sutton-and-andy-barto-a-brief-history-of-rl.en.txt	5.5 KB
03_specifying-policies.en.txt	4.1 KB
01_module-3-learning-objectives_instructions.html	3.3 KB
02_weekly-reading_RLbook2018.pdf	89.4 MB
05_rich-sutton-and-andy-barto-a-brief-history-of-rl.mp4	51.1 MB
04_value-functions.mp4	22.1 MB
03_specifying-policies.mp4	15.7 MB
/.../01_estimating-values-functions-with-supervised-learning/
03_moving-to-parameterized-functions.en.srt	10.7 KB
04_generalization-and-discrimination.en.srt	8.9 KB
05_framing-value-estimation-as-supervised-learning.en.srt	6.4 KB
03_moving-to-parameterized-functions.en.txt	5.7 KB
04_generalization-and-discrimination.en.txt	4.7 KB
02_weekly-reading-on-policy-prediction-with-approximation_instructions.html	1.2 KB
01_module-1-learning-objectives_instructions.html	4.1 KB
05_framing-value-estimation-as-supervised-learning.en.txt	3.4 KB
02_weekly-reading-on-policy-prediction-with-approximation_RLbook2018.pdf	89.4 MB
03_moving-to-parameterized-functions.mp4	25.6 MB
04_generalization-and-discrimination.mp4	13.5 MB
05_framing-value-estimation-as-supervised-learning.mp4	11.2 MB
/.../04_dealing-with-inaccurate-models/
03_drew-bagnell-self-driving-robotics-and-model-based-rl.en.srt	10.6 KB
02_in-depth-with-changing-environments.en.srt	9.4 KB
01_what-if-the-model-is-inaccurate.en.srt	7.2 KB
03_drew-bagnell-self-driving-robotics-and-model-based-rl.en.txt	6.8 KB
02_in-depth-with-changing-environments.en.txt	5.0 KB
01_what-if-the-model-is-inaccurate.en.txt	3.9 KB
04_week-4-summary.en.srt	2.6 KB
04_week-4-summary.en.txt	1.4 KB
06_text-book-part-1-summary_instructions.html	1.2 KB
05_chapter-summary_instructions.html	1.2 KB
05_chapter-summary_RLbook2018.pdf	89.4 MB
06_text-book-part-1-summary_RLbook2018.pdf	89.4 MB
03_drew-bagnell-self-driving-robotics-and-model-based-rl.mp4	36.9 MB
02_in-depth-with-changing-environments.mp4	12.5 MB
01_what-if-the-model-is-inaccurate.mp4	8.1 MB
04_week-4-summary.mp4	4.5 MB
/.../02_the-objective-for-on-policy-prediction/
04_state-aggregation-with-monte-carlo.en.srt	10.5 KB
02_introducing-gradient-descent.en.srt	10.1 KB
03_gradient-monte-for-policy-evaluation.en.srt	9.5 KB
01_the-value-error-objective.en.srt	6.4 KB
04_state-aggregation-with-monte-carlo.en.txt	6.4 KB
02_introducing-gradient-descent.en.txt	6.3 KB
03_gradient-monte-for-policy-evaluation.en.txt	5.0 KB
01_the-value-error-objective.en.txt	3.5 KB
04_state-aggregation-with-monte-carlo.mp4	21.2 MB
03_gradient-monte-for-policy-evaluation.mp4	16.0 MB
02_introducing-gradient-descent.mp4	15.8 MB
01_the-value-error-objective.mp4	11.4 MB
/.../01_learning-parameterized-policies/
03_learning-policies-directly.en.srt	10.4 KB
04_advantages-of-policy-parameterization.en.srt	7.8 KB
03_learning-policies-directly.en.txt	5.6 KB
04_advantages-of-policy-parameterization.en.txt	4.8 KB
01_module-4-learning-objectives_instructions.html	2.9 KB
02_weekly-reading-policy-gradient-methods_instructions.html	1.2 KB
02_weekly-reading-policy-gradient-methods_RLbook2018.pdf	89.4 MB
04_advantages-of-policy-parameterization.mp4	27.3 MB
03_learning-policies-directly.mp4	17.9 MB
/.../02_bellman-equations/
01_bellman-equation-derivation.en.srt	9.9 KB
02_why-bellman-equations.en.srt	7.2 KB
01_bellman-equation-derivation.en.txt	5.3 KB
02_why-bellman-equations.en.txt	4.5 KB
01_bellman-equation-derivation.mp4	17.9 MB
02_why-bellman-equations.mp4	12.4 MB
/.../01_introduction-to-markov-decision-processes/
03_markov-decision-processes.en.srt	9.8 KB
01_module-2-learning-objectives_instructions.html	2.4 KB
02_weekly-reading_instructions.html	1.2 KB
04_examples-of-mdps.en.srt	7.0 KB
03_markov-decision-processes.en.txt	5.3 KB
04_examples-of-mdps.en.txt	3.8 KB
02_weekly-reading_RLbook2018.pdf	89.4 MB
03_markov-decision-processes.mp4	13.0 MB
04_examples-of-mdps.mp4	12.8 MB
/.../02_project-resources/
02_lets-review-expected-sarsa-with-function-approximation.en.txt	2.1 KB
01_lets-review-optimization-strategies-for-nns.en.srt	8.6 KB
05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.en.srt	8.4 KB
04_meeting-with-martha-in-depth-on-experience-replay.en.srt	7.5 KB
03_lets-review-dyna-q-learning-in-a-simple-maze.en.srt	7.1 KB
05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.en.txt	5.2 KB
04_meeting-with-martha-in-depth-on-experience-replay.en.txt	4.8 KB
01_lets-review-optimization-strategies-for-nns.en.txt	4.6 KB
03_lets-review-dyna-q-learning-in-a-simple-maze.en.txt	4.3 KB
02_lets-review-expected-sarsa-with-function-approximation.en.srt	4.0 KB
05_martin-riedmiller-on-the-collect-and-infer-framework-for-data-efficient-rl.mp4	24.7 MB
04_meeting-with-martha-in-depth-on-experience-replay.mp4	22.5 MB
01_lets-review-optimization-strategies-for-nns.mp4	15.0 MB
03_lets-review-dyna-q-learning-in-a-simple-maze.mp4	11.3 MB
02_lets-review-expected-sarsa-with-function-approximation.mp4	8.0 MB
/.../01_feature-construction-for-linear-methods/
04_generalization-properties-of-coarse-coding.en.srt	9.6 KB
06_using-tile-coding-in-td.en.srt	8.5 KB
05_tile-coding.en.srt	5.3 KB
04_generalization-properties-of-coarse-coding.en.txt	5.2 KB
03_coarse-coding.en.srt	5.0 KB
06_using-tile-coding-in-td.en.txt	4.4 KB
02_weekly-reading-on-policy-prediction-with-approximation-ii_instructions.html	1.3 KB
05_tile-coding.en.txt	2.9 KB
01_module-2-learning-objectives_instructions.html	3.2 KB
03_coarse-coding.en.txt	3.1 KB
02_weekly-reading-on-policy-prediction-with-approximation-ii_RLbook2018.pdf	89.4 MB
06_using-tile-coding-in-td.mp4	24.2 MB
04_generalization-properties-of-coarse-coding.mp4	18.8 MB
03_coarse-coding.mp4	10.1 MB
05_tile-coding.mp4	7.9 MB
/.../02_policy-gradient-for-continuing-tasks/
02_the-policy-gradient-theorem.en.srt	9.5 KB
01_the-objective-for-learning-policies.en.srt	9.1 KB
02_the-policy-gradient-theorem.en.txt	5.0 KB
01_the-objective-for-learning-policies.en.txt	4.9 KB
01_the-objective-for-learning-policies.mp4	14.0 MB
02_the-policy-gradient-theorem.mp4	9.8 MB
/.../03_actor-critic-for-continuing-tasks/
02_actor-critic-algorithm.en.srt	9.4 KB
01_estimating-the-policy-gradient.en.srt	7.7 KB
02_actor-critic-algorithm.en.txt	5.0 KB
01_estimating-the-policy-gradient.en.txt	4.7 KB
02_actor-critic-algorithm.mp4	14.8 MB
01_estimating-the-policy-gradient.mp4	14.3 MB
/.../01_the-k-armed-bandit-problem/
02_weekly-reading_RLbook2018.pdf	89.4 MB
03_sequential-decision-making-with-evaluative-feedback.en.srt	8.9 KB
01_module-1-learning-objectives_instructions.html	2.9 KB
02_weekly-reading_instructions.html	1.2 KB
03_sequential-decision-making-with-evaluative-feedback.en.txt	4.8 KB
03_sequential-decision-making-with-evaluative-feedback.mp4	17.1 MB
/.../01_episodic-sarsa-with-function-approximation/
04_episodic-sarsa-in-mountain-car.en.srt	8.9 KB
03_episodic-sarsa-with-function-approximation.en.srt	6.4 KB
04_episodic-sarsa-in-mountain-car.en.txt	4.8 KB
05_expected-sarsa-with-function-approximation.en.srt	4.0 KB
03_episodic-sarsa-with-function-approximation.en.txt	3.9 KB
01_module-3-learning-objectives_instructions.html	2.3 KB
02_weekly-reading-on-policy-control-with-approximation_instructions.html	1.3 KB
05_expected-sarsa-with-function-approximation.en.txt	2.1 KB
02_weekly-reading-on-policy-control-with-approximation_RLbook2018.pdf	89.4 MB
03_episodic-sarsa-with-function-approximation.mp4	18.9 MB
04_episodic-sarsa-in-mountain-car.mp4	16.2 MB
05_expected-sarsa-with-function-approximation.mp4	8.0 MB
/.../06_milestone-5-submit-your-parameter-study/03_congratulations/
01_meeting-with-martha-discussing-your-results.en.txt	2.5 KB
02_course-wrap-up.en.srt	3.0 KB
02_course-wrap-up.en.txt	1.9 KB
03_specialization-wrap-up.en.srt	5.5 KB
01_meeting-with-martha-discussing-your-results.en.srt	4.0 KB
03_specialization-wrap-up.en.txt	3.5 KB
03_specialization-wrap-up.mp4	19.5 MB
01_meeting-with-martha-discussing-your-results.mp4	11.5 MB
02_course-wrap-up.mp4	8.1 MB
/.../04_linear-td/
02_the-true-objective-for-td.en.srt	8.4 KB
03_week-1-summary.en.srt	6.8 KB
01_the-linear-td-update.en.srt	6.4 KB
02_the-true-objective-for-td.en.txt	4.4 KB
03_week-1-summary.en.txt	3.7 KB
01_the-linear-td-update.en.txt	3.4 KB
03_week-1-summary.mp4	17.1 MB
02_the-true-objective-for-td.mp4	14.3 MB
01_the-linear-td-update.mp4	10.4 MB
/.../01_weekly-learning-goals/
01_meeting-with-adam-parameter-studies-in-rl.en.srt	8.3 KB
01_meeting-with-adam-parameter-studies-in-rl.en.txt	5.2 KB
01_meeting-with-adam-parameter-studies-in-rl.mp4	12.0 MB
/.../02_what-to-learn-estimating-action-values/
02_estimating-action-values-incrementally.en.srt	8.2 KB
01_learning-action-values.en.srt	7.1 KB
02_estimating-action-values-incrementally.en.txt	4.4 KB
01_learning-action-values.en.txt	3.9 KB
02_estimating-action-values-incrementally.mp4	20.3 MB
01_learning-action-values.mp4	14.9 MB
/.../03_dyna-as-a-formalism-for-planning/
02_the-dyna-algorithm.en.srt	8.0 KB
01_the-dyna-architecture.en.srt	7.1 KB
03_dyna-q-learning-in-a-simple-maze.en.srt	7.1 KB
01_the-dyna-architecture.en.txt	4.4 KB
03_dyna-q-learning-in-a-simple-maze.en.txt	4.3 KB
02_the-dyna-algorithm.en.txt	4.3 KB
02_the-dyna-algorithm.mp4	11.8 MB
03_dyna-q-learning-in-a-simple-maze.mp4	11.3 MB
01_the-dyna-architecture.mp4	10.1 MB
/.../03_continuing-tasks/
01_continuing-tasks.en.srt	7.8 KB
02_examples-of-episodic-and-continuing-tasks.en.txt	2.6 KB
03_week-2-summary.en.srt	2.8 KB
03_week-2-summary.en.txt	1.5 KB
02_examples-of-episodic-and-continuing-tasks.en.srt	4.8 KB
01_continuing-tasks.en.txt	4.1 KB
01_continuing-tasks.mp4	13.3 MB
02_examples-of-episodic-and-continuing-tasks.mp4	9.6 MB
03_week-2-summary.mp4	5.7 MB
/.../03_exploration-methods-for-monte-carlo/
01_epsilon-soft-policies.en.srt	7.7 KB
01_epsilon-soft-policies.en.txt	4.9 KB
01_epsilon-soft-policies.mp4	13.3 MB
/.../01_what-is-a-model/
03_what-is-a-model.en.srt	7.7 KB
03_what-is-a-model.en.txt	4.1 KB
04_comparing-sample-and-distribution-models.en.srt	4.0 KB
01_module-4-learning-objectives_instructions.html	3.6 KB
04_comparing-sample-and-distribution-models.en.txt	2.1 KB
02_weekly-reading_instructions.html	1.2 KB
02_weekly-reading_RLbook2018.pdf	89.4 MB
03_what-is-a-model.mp4	11.9 MB
04_comparing-sample-and-distribution-models.mp4	7.0 MB
/.../02_off-policy-td-control-q-learning/
03_how-is-q-learning-off-policy.en.srt	7.4 KB
02_q-learning-in-the-windy-grid-world.en.srt	5.9 KB
01_what-is-q-learning.en.srt	5.1 KB
03_how-is-q-learning-off-policy.en.txt	4.1 KB
02_q-learning-in-the-windy-grid-world.en.txt	3.1 KB
01_what-is-q-learning.en.txt	2.7 KB
03_how-is-q-learning-off-policy.mp4	10.4 MB
01_what-is-q-learning.mp4	8.2 MB
02_q-learning-in-the-windy-grid-world.mp4	7.6 MB
/.../01_weekly-learning-goals/
01_meeting-with-adam-getting-the-agent-details-right.en.srt	7.3 KB
01_meeting-with-adam-getting-the-agent-details-right.en.txt	4.5 KB
01_meeting-with-adam-getting-the-agent-details-right.mp4	13.2 MB
/.../02_exploration-under-function-approximation/
01_exploration-under-function-approximation.en.srt	6.7 KB
01_exploration-under-function-approximation.en.txt	3.6 KB
01_exploration-under-function-approximation.mp4	11.6 MB
/.../02_monte-carlo-for-control/
03_solving-the-blackjack-example.en.srt	6.6 KB
01_using-monte-carlo-for-action-values.en.srt	4.8 KB
02_using-monte-carlo-methods-for-generalized-policy-iteration.en.srt	4.1 KB
03_solving-the-blackjack-example.en.txt	3.5 KB
01_using-monte-carlo-for-action-values.en.txt	2.6 KB
02_using-monte-carlo-methods-for-generalized-policy-iteration.en.txt	2.2 KB
03_solving-the-blackjack-example.mp4	14.6 MB
01_using-monte-carlo-for-action-values.mp4	6.8 MB
02_using-monte-carlo-methods-for-generalized-policy-iteration.mp4	5.4 MB
/.../05_course-wrap-up/
01_congratulations.en.srt	6.5 KB
01_congratulations.en.txt	3.5 KB
01_congratulations.mp4	11.7 MB
/.../01_td-for-control/
03_sarsa-gpi-with-td.en.srt	6.3 KB
04_sarsa-in-the-windy-grid-world.en.srt	4.0 KB
03_sarsa-gpi-with-td.en.txt	3.3 KB
01_module-3-learning-objectives_instructions.html	2.9 KB
04_sarsa-in-the-windy-grid-world.en.txt	2.4 KB
02_weekly-reading_instructions.html	1.2 KB
02_weekly-reading_RLbook2018.pdf	89.4 MB
03_sarsa-gpi-with-td.mp4	7.7 MB
04_sarsa-in-the-windy-grid-world.mp4	6.1 MB
/.../02_neural-networks/
02_non-linear-approximation-with-neural-networks.en.srt	6.2 KB
03_deep-neural-networks.en.srt	6.0 KB
01_what-is-a-neural-network.en.srt	5.6 KB
02_non-linear-approximation-with-neural-networks.en.txt	3.9 KB
01_what-is-a-neural-network.en.txt	3.0 KB
03_deep-neural-networks.en.txt	3.2 KB
03_deep-neural-networks.mp4	16.1 MB
02_non-linear-approximation-with-neural-networks.mp4	10.1 MB
01_what-is-a-neural-network.mp4	7.4 MB
/.../01_notebook-grading-faqs/
01__resources.html	5.6 KB
/.../05_planning-learning-acting/02_planning/
01_random-tabular-q-planning.en.srt	5.5 KB
01_random-tabular-q-planning.en.txt	3.0 KB
01_random-tabular-q-planning.mp4	8.2 MB
/.../03_expected-sarsa/
01_expected-sarsa.en.srt	4.6 KB
02_expected-sarsa-in-the-cliff-world.en.srt	3.8 KB
03_generality-of-expected-sarsa.en.srt	2.9 KB
01_expected-sarsa.en.txt	2.9 KB
04_week-3-summary.en.srt	2.7 KB
02_expected-sarsa-in-the-cliff-world.en.txt	2.4 KB
04_week-3-summary.en.txt	1.6 KB
03_generality-of-expected-sarsa.en.txt	1.6 KB
05_chapter-summary_instructions.html	1.3 KB
05_chapter-summary_RLbook2018.pdf	89.4 MB
01_expected-sarsa.mp4	6.6 MB
02_expected-sarsa-in-the-cliff-world.mp4	6.0 MB
03_generality-of-expected-sarsa.mp4	5.5 MB
04_week-3-summary.mp4	3.9 MB
/.../05_course-wrap-up/
01_congratulations-course-4-preview.en.srt	4.3 KB
01_congratulations-course-4-preview.en.txt	2.3 KB
01_congratulations-course-4-preview.mp4	23.2 MB
/.../05_course-wrap-up/
01_congratulations.en.srt	3.4 KB
01_congratulations.en.txt	2.1 KB
01_congratulations.mp4	4.6 MB
Total files 699