Mediocre AI safety as existential risk

technicalities

Mediocre AI safety as existential risk

technicalities

4 min readMar 16, 2022

Comments 12

Sorted by

New & upvoted

UwU

https://reducing-suffering.org/near-miss/

Just gonna boost this excellent piece by Tomasik. I think partial alignment/near-misses causing s-risk is potentially an enormous concern. This is more true the shorter timelines are and thus the more likely people are to try using "hail mary" risky alignment techniques. Also more true for less principled/Agent Foundations-type alignment directions.

[anonymous]

Can someone provide a more realistic example of partial alignment causing s-risk than SignFlip or MisconfiguredMinds? I don't see either of these as something that you'd be reasonably likely to get by say, only doing 95% of the alignment research necessary rather than 110%.

Question Mark

Brian Tomasik wrote something similar about the risks of slightly misaligned artificial intelligence, although it is focused on suffering risks specifically rather than on existential risks in general.

technicalities

I want a word which covers {x-risk, s-risk}, "Existential or worse".

Greg_Colbourn ⏸️

Doom?

Linch

"x-risk" covers "x-risk or worse" right?

Stefan_Schubert

Yes, I'd say so.

I guess that might raise the question if there is a term specifically for x-risks that aren't s-risks. My sense is that people often use the term "x-risk" for that concept as well, but in some contexts one might want to have another term; to distinguish the two concepts.

MaxRa

I always thought s-risks are a subset of x-risks, e.g. that's how CLR framed it here:

https://longtermrisk.org/s-risks-talk-eag-boston-2017/

Basic argument seems to be: Permanent astronomical hell is also curtailment of humanity's potential, one that is very high in the dimensions of scope (astronomical) and intensity (involves hellish levels of suffering).

technicalities

Good framing, but I'm surprised they went for it since it partially obscures S behind its larger more popular brother X.

MaxRa

One explanation might be that historically there seemed to have been somewhat of a divide between people worrying about s-risks and x-risks (which were ~ suffering-focused and ~ classic utilitarians), and this framing might've helped getting more cooperation started.

quinn

"At least existential"

technicalities

Gotta be one word or bust

Comments

More from the author

269

Max Chiswick (1985–2025)

technicalities·1y ago·1m read

504

Peter Eckersley (1979-2022)

technicalities·3y ago·2m read

249

Case for emergency response teams

technicalities, Jan_Kulveit·4y ago·Curated 3y ago·6m read

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·1w ago·Curated 6d ago·22m read

132

Maybe do the thing you wish CEA would do

alejoacelas 🔸·5d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

The first video from Giving What We Can's new channel is out now!

JustinPortela·20h ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...