How to become an AI safety researcher

peterbarnett

How to become an AI safety researcher

peterbarnett

17 min readApr 12, 2022

114

Comments 15

Sorted by

New & upvoted

Andy Jones

To provide a contrasting view, I surveyed the background of Anthropic's technical staff a while ago.

12 out of 24 had PhDs.
Of those 12, 9 were in physics, two were in philosophy, and one was in biology.
Of the 12 without PhDs, the plurality were CS graduates, with the rest being a mix of physics, maths, engineering and biology.
Also one GED.

In particular, we had no ML PhDs as of when the survey was done (though we've hired two since!). I think Anthropic is an unusual organisation and our demographics won't generalise well to the broader community, but I do think it's representative of the ongoing shift to more empirical work.

Rohin Shah

9 + 2 + 2 ≠ 12? Did someone have some kind of double PhD?

Andy Jones

nah i just accidentally a word. fixed!

Mau

Thanks for this!

It is sometimes joked that the qualification needed for doing AI safety work is dropping out of a PhD program, which three people here have done (not that we would exactly recommend doing this!). Aside from those three, almost everyone else is doing or has completed a PhD.

Huh, I wonder if the sample was unrepresentatively high in its ~100% rate of PhD backgrounds? Here are the fractions of research staff at a few AI safety orgs that have started (and potentially completed) PhDs, based mostly on the organizations' websites and listed staff's LinkedIns:

1/8 of research staff at Redwood Research (although this doesn't reflect recent hires)
5/11 of research staff at MIRI
1/2 of research staff at ARC

(These orgs might be unrepresentatively low in PhD backgrounds, but the above numbers at least show that PhD backgrounds are far from universal among AI safety researchers.)

Relatedly, I've heard mixed opinions by safety researchers about whether PhDs are worth it. Some seem to worry that, compared to going directly into safety organizations (when that's an option), PhD programs often (a) have very high time costs, (b) offer very little feedback about what is actually helpful for alignment, and (c) incentivize less relevant/useful work.

So overall I think people shouldn't have the takeaway that there's consensus on the value of PhD programs for this work.

Rohin Shah

This process of formalization is one of the skills that studying mathematics can help build.

Huh, really? My experience of studying math is that you are given the formalizations and must derive conclusions from them, which doesn't seem like it would help much for the skill of coming up with good formalizations.

peterbarnett

That is definitely part of studying math. The thing I was trying to point to is the process of going from an idea or intuition to something that you can write in math. For example, in linear algebra you might have a feeling about some property of a matrix but then you actually have to show it with math. Or more relevantly, in Optimal Policies Tend to Seek Power it seems like the definition of 'power' came from formalizing what properties we would want this thing called 'power' to have.

But I'm curious to hear your thoughts on this, and if you think there are other useful ways to develop this 'formalization' skill.

Rohin Shah

For example, in linear algebra you might have a feeling about some property of a matrix but then you actually have to show it with math.

I would distinguish between "I have an informal proof sketch, or idea for why a theorem should be true, and now I must convert it to a formal proof" and "I am looking at some piece of reality, and have to create mathematical definitions that capture that aspect of reality". These might be sufficiently similar that practicing the former helps the latter, but I suspect they aren't.

Or more relevantly, in Optimal Policies Tend to Seek Power it seems like the definition of 'power' came from formalizing what properties we would want this thing called 'power' to have.

I agree this is a good example of formalization, but it's not an example of "studying math"?

if you think there are other useful ways to develop this 'formalization' skill.

I don't really know. Maybe some kinds of economic modeling? Though I haven't done this myself.

[anonymous]

Maybe someone should compile a bunch of exercises that train the muscle of formalizing intuitions

Iyngkarran Kumar

Strongly second this^

Yonatan Cale

What degrees did people get?

I didn't understand, are you recommending getting a similar degree?

peterbarnett

Not exactly, but it seems useful to know what other people have done if you want to do similar work to them.

Obviously with all the standard hedges that we don't want everyone doing exactly the same thing and thinking the same way.

Yonatan Cale

I suggest that this may send people down the wrong path by mistake:

Your article's title is "How to become an AI Safety researcher", the first title is "paths into AI safety", I would expect the things you write there are.. paths to become an AI safety researcher.. (?)

And regarding

know what other people have done if you want to do similar work to them

I think that in my field (software), this would be pretty wrong. Lots of people have done CS degrees and got to impressive places, but wouldn't suggest to others who want to reach the same places to do a CS degree too.

Lots of people ask me how I became a CTO as a service. The truth is I did a ton of embedded programming. Would I recommend others do that too? Not at all, and I don't plan to ever go back there myself

ben.smith

Interesting post Peter, really appreciate this and got a lot of useful ideas. While trying to assign the appropriate weight to the perspectives here it was useful for me to see where I've been consistent with these success stories and where I might have area to make up.

I wonder if it's worth following up this very useful qualitative work with a quantitative survey?

peterbarnett

Yeah, a more quantitative survey sounds like a useful thing to have, although I don't have concrete plans to do this currently.

I'm slight wary of causing 'survey fatigue' by emailing AI safety people constantly with surveys, but this seems like something that wouldn't be too fatiguing

Jay Bailey🔸

When you refer to "Technical AI safety research", the type of practical AI safety research that involves a lot of writing ML code, do you consider people with titles such as "ML Engineer" and "Research Engineer" at AI safety organisations to be performing this work? Would your advice in those sections apply to someone aiming for these positions?

Comments

More from the author

AI Governance to Avoid Extinction: The Strategic Landscape and Actionable Research Questions [MIRI TGT Research Agenda]

peterbarnett, Aaron_Scher·1y ago·10m read

Consider Preordering If Anyone Builds It, Everyone Dies

peterbarnett·11mo ago·2m read

153

Announcing What The Future Owes Us

peterbarnett·4y ago·1m read

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·5d ago·Curated 2d ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

151

Let's taboo the V-word

lincolnq·6d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

105

Spiro: an update 2.5 years on and a fundraising ask for expansion

Habiba Banu·3d ago·6m read

Summary Back in November 2023 I posted here to launch Spiro and raise our first $198k. Two and a half years later this is an update and a fundraiser for the next step. The short version: we've now reached over-5,900 people with TB preventive medicine, including over 3,000 children under five years old. Our early results have held up well an...