2021 DeepMind x UCL RL Lecture Series - Multi-step & Off Policy [11/13]
Research Scientist Hado van Hasselt discusses multi-step and off policy algorithms, including various techniques for variance reduction.
Slides: https://dpmd.ai/offpolicy
Full video lecture series: https://dpmd.ai/DeepMindxUCL21
Slides: https://dpmd.ai/offpolicy
Full video lecture series: https://dpmd.ai/DeepMindxUCL21
DeepMind
Artificial intelligence could be one of humanity's most useful inventions. DeepMind aims to build advanced AI to expand our knowledge and find new answers. By solving this one thing, we believe we could help people solve thousands of problems.
We’re a te...