My favorite AI governance research this year so far

Zach Stein-Perlman

My favorite AI governance research this year so far

Zach Stein-Perlman

9 min readJul 23, 2023

Comments 3

Sorted by

New & upvoted

Abby Babby

Really useful, thanks so much for sharing! "Towards best practices in AGI safety and governance: A survey of expert opinion" was also my favorite AI Governance research this year so far. Happy to see it featured here :)

Also just want to highlight the only course on AI Governance that seems to exist right now: https://course.aisafetyfundamentals.com/governance

Zach Stein-Perlman

My favorite AI governance research since this post (putting less thought into this list):

Responsible Scaling Policies (METR 2023)
Deployment corrections (IAPS: O'Brien et al. 2023)
Open-Sourcing Highly Capable Foundation Models (GovAI: Seger et al. 2023)
Do companies’ AI Safety Policies meet government best practice? (CFI: Ó hÉigeartaigh et al. 2023)
AI capabilities can be significantly improved without expensive retraining (Davidson et al. 2023)

I mostly haven't really read recent research on compute governance (e.g. 1, 2) or international governance (e.g. 1, 2, 3). Probably some of that would be on this list if I did.

I'm looking forward to the final version of the RAND report on securing model weights.

Feel free to mention your favorite recent AI governance research here.

JP Addison🔸

This is a great list — I'm curating.

This space is changing fast, and curation and distillation seem like important work. Thanks for doing it!

Comments

More from the author

220

FLI open letter: Pause giant AI experiments

Zach Stein-Perlman·3y ago·3m read

134

Maybe Anthropic's Long-Term Benefit Trust is powerless

Zach Stein-Perlman·2y ago·3m read

128

Introducing AI Lab Watch

Zach Stein-Perlman·2y ago·2m read

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·1w ago·Curated 5d ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

201

The first video from Giving What We Can's new channel is out now!

JustinPortela·6d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

122

Let's taboo the V-word

lincolnq·1d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Recent opportunities to take action

EA Organisation Updates thread: July 2026

Dane Valerie·1d ago·1m read

EA Netherlands is recruiting up to two board members — open internationally (one potentially as chair)

James Herbert·1m ago·2m read

The Humane League UK is hiring a Senior Fundraising Administrator

Molly Archer-Zeff, Gavin Chappell-Bates·2h ago·1m read

^{^}

Pieces 1, 2, 3, and 4 are aimed directly at extremely important questions; 6 and 7 are aimed directly at very important questions.

^{^}

For pieces 1, 2, 3, 4, and 6 I would have been very enthusiastic about the proposal. For 5 and 7 I would have been cautiously excited or excited if the project was executed by someone who's a good fit. Note that the phenomenon of my favorite research mostly being research I expect to like is presumably partially due to selection bias in what I read. Moreover, it is partially due to the fact that I haven't deeply engaged with 6 or the technical component of 3 and only engaged with some parts of 7– so saying they're favorites is partially because they sound good before I know all of the details.

My favorite AI governance research this year so far

My favorite AI governance research this year so far

1. Model evaluation for extreme risks (DeepMind, Shevlane et al., May)

2. Towards best practices in AGI safety and governance: A survey of expert opinion (GovAI, Schuett et al., May)

3. What does it take to catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring (Shavit, March)

4. Survey on intermediate goals in AI governance (Rethink Priorities, Räuker and Aird, March)

5. Literature Review of Transformative AI Governance (LPP, Maas, forthcoming) [edit: published, Nov 2023]

6. “AI Risk Discussions” website: Exploring interviews from 97 AI Researchers (Gates et al., February)

7. What a compute-centric framework says about AI takeoff speeds - draft report (OpenPhil, Davidson, January)