Social Intelligence Virtual Gathering
Social Gathering at ICLR 2021 - Thursday May 6th, 2-4pm PDT
Google Research, UC Berkeley
Salesforce Research
Massachusetts Institute of Technology
Massachusetts Institute of Technology
Pacific Time (PDT) | ||
---|---|---|
02:00- 02:05pm | Introductory Remarks | |
02:05-02:25pm | Natasha Jaques Social Reinforcement Learning Show Abstract
Social learning helps humans and animals rapidly adapt to new circumstances, and drives the emergence of complex learned behaviors. This talk focuses on how Reinforcement Learning (RL) agents can benefit from social learning in naturalistic multi-agent environments. In this setting, which is analogous to that of autonomous driving, there are other agents that may have relevant knowledge, but they are not explicitly interested in teaching the RL agent. We first show that traditional model-free RL algorithms do not benefit from social learning in such contexts, and cannot discover the optimal policy even when nearby agents are visibly following it. However, by learning an unsupervised model that predicts the next state, agents implicitly model the behavior of other agents and can leverage social learning to improve their performance. Further, agents that engage in social learning can generalize better to new environments, by following a strategy of using social learning to obtain information about how to perform well on the new task. We then introduce an improved method for social learning, PsiPhi-learning, which leverages the power of successor features to improve RL through modeling other agents, and improve modeling other agents through collecting individual experience via RL. PsiPhi-learning improves over both traditional RL techniques and recent imitation learning techniques, flexibly benefitting from learning from other agents when it is relevant to the task at hand.
|
|
02:25-02:45pm | Alexander Trott The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies Show Abstract |
|
02:45-03:00pm | Discussion Session | |
03:00-03:20pm | Shari Liu Social intelligence: Origins in human infancy, and implications for engineering Show Abstract |
|
03:20-03:40pm | Dylan Hadfield-Menell The Incompleteness Problem for Social AI Systems |
|
03:40-03:55pm | Discussion Session | |
03:45-04:00pm | Closing Remarks |