Neel Nanda on Mechanistic Interpretability: Progress, Limits, and Paths to Safer AI (part 2)

80000_Hours

Neel Nanda on Mechanistic Interpretability: Progress, Limits, and Paths to Safer AI (part 2)

80000_Hours

20 min read ·

Comments 1

Sorted by

New & upvoted

JKM

9mo

His secret? “It’s mostly luck,” he says, but “another part is what I think of as maximising my luck surface area.”

It's worth noting that Neel has two gold and one bronze medal from the International Mathematical Olympiad. In other words, he's a genius. That's got to help a lot in succeeding in this field.

Comments

More from the author

80,000 Hours is hiring a lot right now — come join us!

80000_Hours, Arden Koehler·1mo ago·6m read

How scary is Claude Mythos? 303 pages in 21 minutes

80000_Hours·2mo ago·18m read

Yoshua Bengio thinks he knows how to build safe superintelligence

80000_Hours·1mo ago·21m read

Curated and popular this week

Was Partisanship Good for the Environmental Movement?

Jeffrey Heninger·2y ago·Curated 3d ago·6m read

This is the third in a sequence of posts taken from my recent report: Why Did Environmentalism Become Partisan? Summary Rising partisanship did not make environmentalism more popular or politically effective. Instead, it saw flat or falling overall public opinion, fewer major legislative achievements, and fluctuating executive actions. Public Opinion...

127

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·5d ago·4m read

I think right now EAs might be making a significant mistake by paying insufficient attention to the political realm. As EAs we tend to figure out what’s most impactful for us to work on and focus hard. That’s great! But there are various actions that are ‘non-delegatable’ - the extent to which an individual can do the action is limited (like voting, going to a protest, making hard money contributions to particular campaigns). It might be useful if we were all more in the habit of doing variou...

102

New Video from AI in Context: The Fall and Rise of Sam Altman

ChanaMessinger, phoebe b, Aric Floyd·1w ago·3m read

New Video from AI in Context: The Fall and Rise of Sam Altman If you want to skip straight to the video, here it is! AI in Context is excited to be back with our fourth video! For those just hearing from us, we make videos for 80,000 Hours, telling stories about transformative AI...