Tuesday, October 29, 2024 - 4:00pm
Event Calendar Category
Other LIDS Events
Speaker Name
Peter G. Chang
Affiliation
EECS / LIDS
Building and Room number
32-D650
Building and Room Number
LIDS Lounge
"Evaluating the World Models Used by Pretrained Learners"
A common approach for assessing whether large pretrained models develop world models is by studying the behavior of fixed models. However, many of the benefits of having a world model arise when transferring a model to new tasks (e.g. few-shot learning). In this paper, we ask: what does it mean to test if a learner has a world model embodied in it? We consider a simple definition of a true world model: a mapping from inputs to states. We introduce a procedure that assesses a learner’s world model by measuring its inductive bias when transferring to new tasks. This inductive bias can be measured in two distinct dimensions: does a learner extrapolate to new data by building functions of state, and to what degree do these functions capture the full state? We use this procedure to study the degree to which pretrained models extrapolate to new tasks based on state. We find that models that perform very well on next-token prediction can extrapolate to new tasks with very little inductive bias toward state. We conclude by assessing the possibility that these models learn bundles of heuristics that enable them to perform well on next-token prediction despite preserving little of state.
Peter G. Chang is a first-year Ph.D. student in EECS, advised by Prof. Sendhil Mullainathan. He received his A.B. in Physics and Mathematics and S.M. in Computer Science, both from Harvard University. He is primarily interested in augmenting scientists with tools to understand the world.
****************************************************
About Autonomy Tea Talks:
Tea talks are 20-minute-long informal talks for the purpose of sharing ideas and making others aware about some of the topics that may be of interest to the LIDS Community.
The session is followed by light refreshments.
Email lids_autonomy_teas[at]mit[dot]edu for more information.
Sign-up to present at LIDS Autonomy Tea Talks
LIDS Autonomy Tea Talks Committee
Ahmed Alahmed, Soumya Sudhakar,Jack Zhang