When "yang" goes wrong

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

190

The first video from Giving What We Can's new channel is out now!

JustinPortela·5d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

Let's taboo the V-word

lincolnq·22h ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Recent opportunities to take action

EA Organisation Updates thread: July 2026

Dane Valerie·4h ago·1m read

The EA Opportunities Board now has full-time roles

Agnes Hasselblad 🔸·6h ago·3m read

Hiring: Grants & Operations Associate - Giving What We Can

Giving What We Can🔸, Zou Xinyi 🔸·1h ago·2m read

For AI risk stories centered on this dynamic, see Hendrycks (2023) and Critch (2021). ↩︎
See, for example, the discourse about the "strategy-stealing assumption," and about the comparative costs of different sorts of expansion-into-space. ↩︎
Yudkowsky, for example, seems to generally expect sufficiently rational agents to avoid multi-polar traps. ↩︎
I think this is the question at stake with the more reasonable forms of accelerationism. ↩︎
See Bostrom (2004), Section 11, for extremely similar rhetoric. ↩︎
Indeed, my sense is that debates about "top down vs. bottom up" often occur at the level of mood affiliation and priors, when in fact, the devil is in the details, and in those pesky empirics. For what it's worth, though: on AI, my current view is that it should be illegal to build bioweapons in your basement, and that it's fine regulate nukes, and that if the logic driving those conclusions generalizes to AI, we should follow the implication. ↩︎
Bostrom's "semi-anarchic default condition" is characterized by limited capacity for preventative policing (e.g., not enough to ensure extremely reliable adherence to the law), limited capacity for global governance to solve coordination problems, and sufficiently diverse motivations that many actors are substantially selfish, and some small number are omnicidal. ↩︎
Or at least, someone with your values. H/t Crawford: "If you wish to make an apple pie, you must first become dictator of the universe." ↩︎
See e.g. here for discussion from Yudkowsky himself. Though note that agents with goals that are bounded in time, or in resource-hungry-ness, can still create successor-agents without these properties. ↩︎
Though see Alexander on technocracy for some useful nuance. ↩︎
At least if they read Scott Alexander. Which many do. ↩︎
Unless it can't solve the alignment problem, either. ↩︎
Unless, of course, your heart says otherwise. ↩︎
Other endings include: everyone dies, or someone wins and doesn't eat the galaxies. Or maybe balance of power between hearts stays in perpetual flux, without every crystallizing. ↩︎
Or something. At least on our current picture. Plus, you know, everything happening in the rest of the multiverse. ↩︎
Yudkowsky is clear that in principle, you can build agents without these properties, the same way you can build a machine that thinks 222+222=555. But the maximizer-ish properties are extremely "natural," especially if you're capable enough to burn the GPUs. ↩︎
I do think most EAs are sincerely trying to do good. Indeed, I think the EA community is notably high on sincerity in general. But sincerity can be scary, too. ↩︎
And even earlier, our moral psychology was plausibly shaped and "domesticated," in central part, by an evolutionary history in which power-seeking bullies got, um, murdered and removed from the gene pool. Thanks to Carl Shulman for discussion. ↩︎
See e.g. here for a longer list of views/vibes that accelerationism can encompass: ↩︎
See e.g. here, being excited about AIs wiping out humans; and here, siding with Moloch against Elua. (Though, I also think that Land, in the latter post, can be read more directly as a full-scale nihilist; and I don't claim any deep engagement with Land's corpus as a whole.) ↩︎
In particular, my sense is that causal proponents of Land-ian vibes aren't often distinguishing clearly between the empirical claim that Strength will lead to something-else-judged-Good, and the normative claim that Strength is Good whatever-it-leads-to -- such that e.g. the response to "what if the Nazis are Strong?" isn't "then Strength would be bad in that case" but rather "they won't be Strong." And in fairness, per some of my comments about Alexander above, I do think Strength favors Goodness is various way (more on this in a future essay). But the conceptual distinction (and importance of continuing to draw it) persists hard. ↩︎
e/acc founder Beff Jezos: "The fundamental basis for the movement is this sort of realization that life is a sort of fire that seeks out free energy in the universe and seeks to grow...we’re far more efficient at producing heat than let’s say just a rock with a similar mass as ourselves. We acquire free energy, we acquire food, and we’re using all this electricity for our operation. And so the universe wants to produce more entropy and by having life go on and grow, it’s actually more optimal at producing entropy because it will seek out pockets of free energy and burn it for its sustenance and further growth." We could potentially reconstruct Jezos's position as a purely empirical claim about how the universe will tend to evolve over time -- one that we should incorporate into our planning and prediction. But it seems fairly clear, at least in this piece, that he wants to take some kind of more normative guidance from the direction in question. See also this quote, which I think is from Land's "The Thirst for Annihilation" (this is what Liu here suggests) though I'd need to get the book to be sure: "All energy must ultimately be spent pointlessly and unreservedly, the only questions being where, when, and in whose name... Bataille interprets all natural and cultural development upon the earth to be side-effects of the evolution of death, because it is only in death that life becomes an echo of the sun, realizing its inevitable destiny, which is pure loss." I find it interesting that for Land/Bataille, here, the ultimate goal seems to be death, loss, nothingness. And on this reading, it's really quite a negative and pessimistic ethic (cf "virulent nihilism," "thirst for annihilation," etc). But the accelerationists seem to think that their thing is optimism? ↩︎

When "yang" goes wrong

Becoming God

Moloch and Stalin

Wariness around power-seeking