How the AI safety technical landscape has changed in the last year, according to some practitioners

tlevin

How the AI safety technical landscape has changed in the last year, according to some practitioners

tlevin

2 min readJul 26, 2024

Comments 1

Sorted by

New & upvoted

Chris Leong

I don’t know the exact dates, but: a)proof-based methods seem to be receiving a lot of attention b) def/acc is becoming more of a thing c) more focus on concentration of power risk (tbh, while there are real risks here, I suspect most work here is net-negative)

Comments

More from the author

Timelines to what? A proposal

tlevin·3w ago·4m read

What SB 53, California’s new AI law, does

tlevin·9mo ago·5m read

118

A case for donating to AI risk reduction (including if you work in AI)

tlevin·1y ago·4m read

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PAMC 🔸·1w ago·Curated 3d ago·22m read

Maybe do the thing you wish CEA would do

alejoacelas 🔸·2d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·5d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Recent opportunities to take action

RP is looking for project founders in neglected animal areas

Rethink Priorities·3d ago·7m read

158

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·1w ago·4m read

Announcing the Safe Pareto Improvements (SPI) Fundamentals Program

Center on Long-Term Risk, Anthony DiGiovanni 🔸, Santeri T 🔹·2d ago·3m read