Fall 2022

October 18, 2022

Lessons from AlphaZero for Control System Design and Discrete Optimization

Speaker: Dimitri Bertsekas (MIT & ASU)

We focus on a new conceptual framework for approximate Dynamic Programming (DP) and Reinforcement Learning (RL). It revolves around two algorithms that operate in synergy through the powerful mechanism of Newton's method, applied to Bellman’...