Rethinking Dexterous Manipulation: From Algorithms to Hardware

Monday, April 26, 2021 - 11:30am to 12:30pm

Event Calendar Category

CSAIL

Speaker Name

Vikash Kumar

Affiliation

Google Brain

Zoom meeting id

895135

Join Zoom meeting

https://mit.zoom.us/j/91419947439

Abstract

During the last decade, learning-based techniques have been quite successful in generating motor skills in simulations. However, these techniques in their current form are less effective on real robots, especially in contact-rich dexterous manipulation settings. In this talk, I'll revisit the problem of learning dexterous manipulation from the first principles. I'd argue that in addition to algorithmic developments, learning paradigms in the real world, as well as hardware infrastructure, needs significant attention to imparting human-level dexterity to our robots in a scalable way. 

On the algorithmic front, I'll discuss a game-theoretic formulation for model-based reinforcement learning (MBRL) that not only unifies, and generalizes many previous MBRL algorithms but also provides guidelines for designing more stable algorithms capable of learning general manipulation behaviors that can be retargeted to new unseen tasks. I’ll also share (less discussed) experiences and lessons learned while maturing our algorithmic paradigms to acquire high-dimensional, contact-rich, dexterous manipulation behaviors in the real world in a scalable way. And how we went from half a million-dollar hardware that broke every 30 mins to a scalable and modular 5000$ setup that can learn behaviors unattended for weeks without any human intervention.

Biography

Vikash Kumar is a research scientist in Facebook AI Research (FAIR). He finished his Ph.D. from the University of Washington with Prof. Emo Todorov and Prof. Sergey Levine, where his research focused on imparting human-level dexterity to anthropomorphic robotic hands. He continued his research as a post-doctoral fellow with Prof. Sergey Levine at Univ. of California Berkeley where he further developed his methods to work on low-cost scalable systems. He also spent time as a Research Scientist at OpenAI and Google-Brain where he diversified his research on low-cost scalable systems to the domain of multi-agent locomotion. He has also been involved with the development of the MuJoCo physics engine, now widely used in the fields of Robotics and Machine Learning. His works have been recognized with the best Master's thesis award, best manipulation paper at ICRA’16, CIFAR AI chair '20 (declined), and have been widely covered with a wide variety of media outlets such as NewYorkTimes, Reuters, ACM, WIRED, MIT Tech reviews, IEEE Spectrum, etc.