
Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 9: RL for LLMs
View course details: https://online.stanford.edu/courses/xcs224r-deep-reinforcement-learning
April 30, 2025
This guest lecture covers RL for LLMs: preference optimization.
To learn more about enrolling in the graduate course, visit: https://online.stanford.edu/courses/cs224r-deep-reinforcement-learning
To follow along with the course schedule and syllabus, visit:
https://cs224r.stanford.edu/
Archit Sharma
Researcher on Gemini team, Lead author of DPO
April 30, 2025
This guest lecture covers RL for LLMs: preference optimization.
To learn more about enrolling in the graduate course, visit: https://online.stanford.edu/courses/cs224r-deep-reinforcement-learning
To follow along with the course schedule and syllabus, visit:
https://cs224r.stanford.edu/
Archit Sharma
Researcher on Gemini team, Lead author of DPO
Stanford Online
You can gain access to a world of education through Stanford Online, the Stanford School of Engineering’s portal for academic and professional education offered by schools and units throughout Stanford University. https://online.stanford.edu/
Our robust ...