What AI companies can do today to help with the most important century

Holden Karnofsky

What AI companies can do today to help with the most important century

Comments

More from the author

135

Responsible Scaling Policy v3

Holden Karnofsky·4mo ago·43m read

644

Some comments on recent FTX-related events

Holden Karnofsky·3y ago·5m read

523

EA is about maximization, and maximization is perilous

Holden Karnofsky·3y ago·8m read

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·1w ago·Curated 12h ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

153

Maybe do the thing you wish CEA would do

alejoacelas 🔸·6d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

151

The first video from Giving What We Can's new channel is out now!

JustinPortela·2d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

Recent opportunities to take action

Find funding, fast

Austin·1d ago·3m read

New round of digital minds funding opportunities at Longview

zdgroff, Longview Philanthropy·3d ago·2m read

173

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·2w ago·4m read

Geoffrey Miller

Holden - thanks for this thoughtful and constructive piece.

However, I think a crucial strategy is missing here.

If we're serious that AI imposes existential risks on humanity, then the best thing that AI companies can do to help us survive this pivotal century is simple: Shut down their AI research. Do something else. Act like they care about the fate of their kids and grandkids.

AI research doesn't need to be shut down forever. Maybe just for the next few centuries, until we better understand the risks and how to manage them.

I simply don't understand why so many EAs are encouraging AI development as if it's too cool to question, too inevitable to challenge, and too incentivized to deter. Almost all of us agree that AI will impose potentially catastrophic risks. We all agree that AI alignment is far from solved, and many of us believe it probably won't be solved in time to save us from recklessly fast AI development.

We probably can't shut down AI research through government regulation or gentle coaxing, given the coordination problems, governance problems, arms races, and corporate incentives. But we could probably do it through promoting new social & ethical norms that impose a heavy moral stigma against AI research, AI researchers, and AI companies. Historically, intense moral stigmatization has been successful at handicapping, delaying, pausing, defunding, marginalizing, and/or shutting down many research fields. And moral stigmatization in the modern social media world can operate even more quickly, powerfully, globally, and effectively. (I'm working on a longer piece about this moral stigmatization strategy for reducing AI X-risk.)

In short: maybe it's time for EA to stop playing nice with the AI industry -- given that the AI industry is not playing safely with humanity's future.

And maybe it's time to call a spade a spade: if AI companies are pursuing AI capabilities at a rate that could end our species, without any credible safeguards that could protect our species, then they're evil. Maybe we should say they're evil, treat them as evil, and encourage others to do the same, until they stop doing evil.

Disclosure: my wife works at one such company (Anthropic) and used to work at another (OpenAI), and has equity in both. ↩
Though I won’t, because I decided I don’t want to get into a thing about whom I did and didn’t link to. Feel free to give real-world examples in the comments! ↩
Now, AI companies could sometimes be doing “responsible” or “safety-oriented” things in order to get good PRs, recruit employees, make existing employees happy, etc. In this sense, the actions could be ultimately profit-motivated. But that would still mean there are enough people who care about reducing AI risk that actions like these have PR benefits, recruiting benefits, etc. That’s a big deal! And it suggests that if concern about AI risks (and understanding of how to reduce them) were more widespread, AI companies might do more good things and fewer dangerous things. ↩
You could argue that it would be better for the world to develop extremely powerful AI systems sooner, for reasons including:
- You might be pretty happy with the global balance of power between countries today, and be worried that it’ll get worse in the future. The latter could lead to a situation where the “wrong” government leads the way on transformative AI.
- You might think that the later we develop transformative AI, the more quickly everything will play out, because there will be more computing resources available in the world. E.g., if we develop extremely powerful systems tomorrow, there would only be so many copies we could run at once, whereas if we develop equally powerful systems in 50 years, it might be a lot easier for lots of people to run lots of copies. (More: Hardware Overhang)
A key reason I believe it’s best to avoid acceleration at this time is because it seems plausible (at least 10% likely) that transformative AI will be developed extremely soon - as in within 10 years of today. My impression is that many people at major AI companies tend to agree with this. I think this is a very scary possibility, and if this is the case, the arguments I give in the main text seem particularly important (e.g., many key interventions seem to be in a pretty embryonic state, and awareness of key risks seems low).
A related case one could make for acceleration is “It’s worth accelerating things on the whole to increase the probability that the particular company in question succeeds” (more here: the “competition” frame). I think this is a valid consideration, which is why I talk about tricky tradeoffs in the main text. ↩
Note that my wife is a former employee of OpenAI, the company I link to there, and she owns equity in the company. ↩

What AI companies can do today to help with the most important century

What AI companies can do today to help with the most important century

Some basics: alignment research, strong security, safety standards

Avoiding hype and acceleration

Preparing for difficult decisions ahead

Succeeding

Some things I’m less excited about

Footnotes