Effective Altruism Forum

Comments 1

Sorted by

New & upvoted

Executive summary: OpenAI's preparedness framework for AI safety makes valuable contributions, especially around communication, clarity, openness to feedback, and emergency response planning. But it could be strengthened by more focus on general intelligence, clearer safeguard requirements, adjusting autonomy thresholds, granting veto power to safety roles, and enhancing security.

Key points:

OpenAI communicated the framework well, with a concern-raising name and clarity that signals risks to policymakers.
Concrete eval examples, risk spectrums, and emergency plans are strengths.
More focus is needed on general intelligence safety levels, not just narrow capabilities.
Safeguard requirements for high-risk models should be specified.
Autonomy thresholds may be too high given deployment plans.
Grant veto power on models to the Safety Advisory Chair and Preparedness head.
Commit to security practices that protect models from theft.
Increase frequency of emergency response drills.

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.

Comments

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·1w ago·Curated 5d ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

114

Spiro: an update 2.5 years on and a fundraising ask for expansion

Habiba Banu·6d ago·6m read

Summary Back in November 2023 I posted here to launch Spiro and raise our first $198k. Two and a half years later this is an update and a fundraiser for the next step. The short version: we've now reached over-5,900 people with TB preventive medicine, including over 3,000 children under five years old. Our early results have held up well an...

How (not) to fundraise from Anthropic staff

Jack Lewars·5d ago·7m read

Adapted from my Substack, Funding Anthropalypse. Short version: if you want a share of the coming Anthropic and OpenAI windfall - the $37bn+ that could be in play next year - the way in is to become 'legibly excellent', so the evaluators and donors that frontier lab staff already trust point them to yo...

Recent opportunities to take action

Starting an EA group @ SUNY Binghamton

micahzarin·8h ago·1m read

Marginal Victories: career advising and opportunities for U.S. democracy preservation & political work

Annika Burman 🔸·1d ago·2m read

I'm stepping down as Hive's Executive Director, and we're hiring my successor

SofiaBalderson, Hive·1d ago·3m read