Stanford CS25: Transformers United V6 I Serving Transformers: Lessons from the Trenches
For more information about Stanford’s graduate programs, visit: https://online.stanford.edu/graduate-education
May 28, 2026
Serving Transformers: Lessons from the Trenches of Production Inference
This seminar covers insights, lessons, and gnarly scars from serving transformer model inferences at the scale of thousands of GPUs.
Follow along with the seminar schedule. Visit: https://web.stanford.edu/class/cs25/
Guest Speaker: Charles Frye (Modal)
Instructors:
• Steven Feng, Stanford Computer Science PhD student and NSERC PGS-D scholar
• Karan P. Singh, Electrical Engineering PhD student and NSF Graduate Research Fellow in the Stanford Translational AI Lab
• Michael C. Frank, Benjamin Scott Crocker Professor of Human Biology Director, Symbolic Systems Program
• Christopher Manning, Thomas M. Siebel Professor in Machine Learning, Professor of Linguistics and of Computer Science, Co-Founder and Senior Fellow of the Stanford Institute for Human-Centered Artificial Intelligence (HAI)
Stanford Online
You can gain access to a world of education through Stanford Online, the Stanford School of Engineering’s portal for academic and professional education offered by schools and units throughout Stanford University. https://online.stanford.edu/ Our robust ...