FileMood

Download [ FreeCourseWeb.com ] Udemy - Advanced Reinforcement Learning - policy gradient methods

FreeCourseWeb com Udemy Advanced Reinforcement Learning policy gradient methods

Name

[ FreeCourseWeb.com ] Udemy - Advanced Reinforcement Learning - policy gradient methods

 DOWNLOAD Copy Link

Total Size

768.7 MB

Total Files

88

Last Seen

2024-11-19 00:29

Hash

5743EEB7D00724621857FF1C05B45E1C32453D4A

/

Get Bonus Downloads Here.url

0.2 KB

/01 - Introduction/

001 Introduction.html

0.1 KB

002 Reinforcement Learning series.html

0.7 KB

003 Google Colab.mp4

6.1 MB

003 Google Colab_en.vtt

1.8 KB

004 Where to begin.html

0.1 KB

/.../02 - Refresher The Markov Decision Process (MDP)/

001 Elements common to all control tasks.mp4

40.6 MB

001 Elements common to all control tasks_en.vtt

6.1 KB

002 The Markov decision process (MDP).mp4

26.3 MB

002 The Markov decision process (MDP)_en.vtt

5.8 KB

003 Types of Markov decision process.mp4

9.1 MB

003 Types of Markov decision process_en.vtt

2.2 KB

004 Trajectory vs episode.mp4

5.2 MB

004 Trajectory vs episode_en.vtt

1.1 KB

005 Reward vs Return.mp4

5.5 MB

005 Reward vs Return_en.vtt

1.6 KB

006 Discount factor.mp4

15.5 MB

006 Discount factor_en.vtt

4.2 KB

007 Policy.mp4

7.8 MB

007 Policy_en.vtt

2.1 KB

008 State values v(s) and action values q(s,a).mp4

4.5 MB

008 State values v(s) and action values q(s,a)_en.vtt

1.2 KB

009 Bellman equations.mp4

13.0 MB

009 Bellman equations_en.vtt

3.1 KB

010 Solving a Markov decision process.mp4

14.8 MB

010 Solving a Markov decision process_en.vtt

3.3 KB

/.../03 - Refresher Monte Carlo methods/

001 Monte Carlo methods.mp4

14.4 MB

001 Monte Carlo methods_en.vtt

3.4 KB

002 Solving control tasks with Monte Carlo methods.mp4

24.9 MB

002 Solving control tasks with Monte Carlo methods_en.vtt

7.2 KB

003 On-policy Monte Carlo control.mp4

21.4 MB

003 On-policy Monte Carlo control_en.vtt

4.7 KB

/.../04 - Refresher Temporal difference methods/

001 Temporal difference methods.mp4

13.2 MB

001 Temporal difference methods_en.vtt

3.7 KB

002 Solving control tasks with temporal difference methods.mp4

15.2 MB

002 Solving control tasks with temporal difference methods_en.vtt

3.7 KB

003 Monte Carlo vs temporal difference methods.mp4

9.3 MB

003 Monte Carlo vs temporal difference methods_en.vtt

1.6 KB

004 SARSA.mp4

18.6 MB

004 SARSA_en.vtt

4.0 KB

005 Q-Learning.mp4

11.6 MB

005 Q-Learning_en.vtt

2.6 KB

006 Advantages of temporal difference methods.mp4

3.9 MB

006 Advantages of temporal difference methods_en.vtt

1.2 KB

/.../05 - Refresher N-step bootstrapping/

001 N-step temporal difference methods.mp4

13.1 MB

001 N-step temporal difference methods_en.vtt

3.4 KB

002 Where do n-step methods fit.mp4

11.7 MB

002 Where do n-step methods fit_en.vtt

2.7 KB

003 Effect of changing n.mp4

29.4 MB

003 Effect of changing n_en.vtt

4.8 KB

/.../06 - Refresher Brief introduction to Neural Networks/

001 Function approximators.mp4

38.1 MB

001 Function approximators_en.vtt

8.8 KB

002 Artificial Neural Networks.mp4

25.5 MB

002 Artificial Neural Networks_en.vtt

4.0 KB

003 Artificial Neurons.mp4

26.9 MB

003 Artificial Neurons_en.vtt

6.0 KB

004 How to represent a Neural Network.mp4

40.0 MB

004 How to represent a Neural Network_en.vtt

7.4 KB

005 Stochastic Gradient Descent.mp4

52.3 MB

005 Stochastic Gradient Descent_en.vtt

6.6 KB

006 Neural Network optimization.mp4

24.5 MB

006 Neural Network optimization_en.vtt

4.5 KB

/.../07 - Refresher REINFORCE/

001 Policy gradient methods.mp4

22.7 MB

001 Policy gradient methods_en.vtt

4.9 KB

002 Representing policies using neural networks.mp4

29.1 MB

002 Representing policies using neural networks_en.vtt

5.3 KB

003 Policy performance.mp4

8.9 MB

003 Policy performance_en.vtt

2.6 KB

004 The policy gradient theorem.mp4

16.7 MB

004 The policy gradient theorem_en.vtt

3.9 KB

005 REINFORCE.mp4

13.9 MB

005 REINFORCE_en.vtt

4.2 KB

006 Parallel learning.mp4

12.9 MB

006 Parallel learning_en.vtt

3.7 KB

007 Entropy regularization.mp4

24.3 MB

007 Entropy regularization_en.vtt

6.8 KB

008 REINFORCE 2.mp4

11.4 MB

008 REINFORCE 2_en.vtt

2.4 KB

/.../08 - PyTorch Lightning/

001 PyTorch Lightning.mp4

33.6 MB

001 PyTorch Lightning_en.vtt

9.5 KB

002 Link to the code notebook.html

0.1 KB

/.../09 - REINFORCE for continuous control tasks/

001 REINFORCE for continuous action spaces.html

0.1 KB

/.../10 - Advantage Actor Critic (A2C)/

001 A2C.mp4

52.5 MB

001 A2C_en.vtt

10.8 KB

/.../11 - Generalized Advantage Estimation (GAE)/

001 Generalized Advantage Estimation.html

0.1 KB

/.../12 - Proximal Policy Optimization (PPO)/

001 Proximal Policy Optimization.html

0.1 KB

/.../13 - Phasic PPO/

001 Phasic PPO.html

0.1 KB

/~Get Your Files Here !/

Bonus Resources.txt

0.4 KB

 

Total files 88


Copyright © 2024 FileMood.com