AI Safety Unconference @ conf

The purpose of the AI Safety Unconference is to foster connection and shared understanding between researchers interested in AI safety. It is a series of events, hosted alongside main AI/ML conferences. It welcomes both newly interested and established researchers. It features talks, moderated discussions, one-on-ones, free-form interactions, and participant-driven activities.

AISU @ NeurIPS 2022

~85 participants, independents and from various affiliations: OpenAI, DeepMind, Cambridge, MIRI, Mila, Cornell, Anthropic, MIT, Columbia, Stanford, U Toronto, Waterloo, Cooperative AI, ...

Lightning talks

Haydn Belfield: What standard-setting in EU + US might mean for AI safety
Esben Kran: Hackathons in AI safety research
Franziska Boenisch: Privacy attacks against federated learning
Aaron Tucker: Bandits with Costly Reward Observations
Lewis Hammond: Cooperative AI
Adam Dziedzic: Stealing and defending self-supervised models
David Lindner: Active Learning for Reward Modelling
Lauro Langosco di Langosco: An empirical demonstration of deceptive alignment
Zhijing Jin: Causally aligning language models

Facilitated discussions (1h each)

Haydn Belfield: AI governance
Adam Dziedzic: Is this model mine? On stealing and defending machine learning models
Lewis Hammond: Cooperative AI
Lauro Langosco di Langosco: Deceptive alignment

Testimonials

"This was a fascinating event that was helpful for keeping up with the cutting edge of the field, and for launching collaborations."
— Haydn Belfield

"The AI safety unconference was very useful to meet and talk with the AI safety researchers at NeurIPS."
— Esben Kran

"It was very reassuring to hear that diverse perspectives on AI risk are being studied seriously, including criticism of the AI safety community."
— Arvind Raghavan

Archived website

AISU @ NeurIPS 2019

~50 participants, independents and from various affiliations: OpenAI, DeepMind, Cambridge, MIRI, Mila, ...

Participant-driven discussions, multiple lighting talks, ...

Archived website

AISU @ NeurIPS 2018

~50 participants from various affiliations: UC Berkeley, Vector Institute, Mila, OpenAI, DeepMind, Oxford, CHAI, Mcgill, NYU, Partnership on AI, etc

Talks from:

Adam Gleave, Jan Leike, David Krueger, Dan Hendrycks, Aaron Tucker, Victoria Krakovna

Testimonials

"A great way to meet the best people in the area and propel daring ideas forward."
— Stuart Armstrong

"The event was a great place to meet others with shared research interests. I particularly enjoyed the small discussion groups that exposed me to new perspectives."
— Adam Gleave

Archived website