| Name | Last modified | Size | Description |
|
| Parent Directory | | - | |
| Lecture10_1.mp4 | 2021-01-25 11:15 | 1.6G | |
| moocrobot-tud2021-2-m10-1_Todays_Objective.mp4 | 2021-05-09 11:38 | 33M | |
| moocrobot-tud2021-2-m10-2_Introduction.mp4 | 2021-05-09 11:39 | 143M | |
| moocrobot-tud2021-2-m10-3_Action_Selection.mp4 | 2021-05-09 11:39 | 95M | |
| moocrobot-tud2021-2-m10-4_Model_Free_Policy_Search.mp4 | 2021-05-09 11:39 | 26M | |
| moocrobot-tud2021-2-m10-5_Episole-Based_Evaluation_Strategy.mp4 | 2021-05-09 11:39 | 31M | |
| moocrobot-tud2021-2-m10-6_Step-Based_Evaluation_Strategy.mp4 | 2021-05-09 11:39 | 21M | |
| moocrobot-tud2021-2-m10-7_Summary-Episole-Based_vs_Step-Based.mp4 | 2021-05-09 11:40 | 55M | |
| moocrobot-tud2021-2-m10-8_Episole-Based_Policy_Search-Gradient_Based_Policy_Updates.mp4 | 2021-05-09 11:40 | 235M | |
| moocrobot-tud2021-2-m10-9_Episole-Based_Policy_Search-Finite_Differences.mp4 | 2021-05-09 11:40 | 52M | |
| moocrobot-tud2021-2-m10-10_Episole-Based_Policy_Search-Likelihood_Policy_Gradients.mp4 | 2021-05-09 11:41 | 57M | |
| moocrobot-tud2021-2-m10-11_Episole-Based_Policy_Search-Baselines.mp4 | 2021-05-09 11:41 | 48M | |
| moocrobot-tud2021-2-m10-12_Step-Based_Policy_Gradients-Derivations.mp4 | 2021-05-09 11:41 | 315M | |
| moocrobot-tud2021-2-m10-13_Step-Based_Policy_Gradients-Matrix_in_Standard_Gradients.mp4 | 2021-05-09 11:42 | 285M | |
| moocrobot-tud2021-2-m10-14_KL_Divergences.mp4 | 2021-05-09 11:43 | 139M | |
| moocrobot-tud2021-2-m10-15_KL_Divergences_and_The_Fisher_Information_Matrix.mp4 | 2021-05-09 11:44 | 148M | |
| moocrobot-tud2021-2-m10-16_Natural_Gradients.mp4 | 2021-05-09 11:44 | 249M | |
| moocrobot-tud2021-2-m10-17_Wrap-up.mp4 | 2021-05-09 11:45 | 26M | |
|