2021 DeepMind x UCL RL Lecture Series - Model-free Control [6/13]
Research Scientist Hado van Hasselt covers prediction algorithms for policy improvement, leading to algorithms that can learn good behaviour policies from sampled experience.
Slides: https://dpmd.ai/modelfreecontrol
Full video lecture series: https://dpmd.ai/DeepMindxUCL21
DeepMind
Artificial intelligence could be one of humanity's most useful inventions. DeepMind aims to build advanced AI to expand our knowledge and find new answers. By solving this one thing, we believe we could help people solve thousands of problems. We’re a te...