Provably Robust Algorithms for Prediction and Control

Thursday, April 7, 2022 - 10:00am to 11:00am

Event Calendar Category

EECS

Speaker Name

Karan Singh

Affiliation

Microsoft Research Redmond

Building and Room Number

Grier A

Abstract

Feedback-driven decision-making systems are at the emerging frontier of machine learning applications. Upcoming applications of societal consequence, such as self-driving vehicles and smartwatch-based health interventions, have to contend with the challenge of operating in reactive stateful environments. In this talk, I will describe my work on designing principled robust algorithms for feedback-driven learning, with provable guarantees on computational and statistical efficiency. First, I will introduce an efficient instance-optimal algorithm for control in the presence of adversarial disturbances. Beyond the realm of both stochastic and robust control, such a data-driven notion of optimality combines worst-case guarantees with a promise of exceptional performance on benign instances. Moving on to prediction, I will present a computationally and statistically efficient forecasting strategy for latent-state dynamical systems exhibiting long term dependencies, mitigating the statistical challenge of learning with correlated samples, and the computational difficulties associated with a non-convex maximum likelihood objective. To conclude, I will discuss some practically relevant fundamental questions at the intersection of machine learning, optimization, and control that have the potential to unlock real progress in downstream applications.

Biography

Karan Singh is a postdoctoral researcher at Microsoft Research Redmond. In November 2021, he completed his PhD in Computer Science at Princeton University, where he was awarded the Porter Ogden Jacobus Fellowship, Princeton University's highest graduate student honor. Karan's research addresses statistical and computational challenges in feedback-driven interactive learning, spanning both prediction and control. His results draw from the algorithmic toolkits of optimization and online learning, together with techniques from dynamical systems and control theory.