Robot Foundation Models
Keynote: Robot Foundation Models - Sergey Levine, Associate Professor, Department of Electrical Engineering and Computer Sciences, UC Berkeley General-purpose foundation models have transformed how we approach machine learning: instead of training domain-specific classifiers or generative models, we now use general-purpose models trained on broad web-scale datasets across natural language processing, computer vision, multimedia generation, and speech. What would it take to enable such general models to interact with the physical world, enabling them to control robotic systems? In this talk, I’ll provide a brief summary of the history of vision-language-action (VLA) models, describe recent developments, and present results of state-of-the-art VLAs.