Hide table of contents

March 17 - 23 will be Existential Choices Debate Week on the EA Forum. We’ll be discussing the debate statement "On the margin[1], it is better to work on reducing the chance of our[2] extinction than increasing the value of futures where we survive[3]" 

Add this and future events to your calendar

Like the last two debate weeks (1,2), during the event you’ll find a banner on the front page with an axis going from “strong disagree” to “strong agree” where forum users can place their avatar, and attach a comment explaining their position.

We’ll also be hosting a “symposium” with Will MacAskill — a chance to join in a live debate with experts on the issue. Provisionally, this will happen on the Monday of debate week, in the comments of a symposium post[4].

If you’d like to take part, you can start thinking about writing a post to be published during the week. Posts considered a part of the event don’t have to answer the question directly — the rule of thumb is that if a post could plausibly help someone decide how to vote, it’s relevant for the week.

As always, message me if you have any questions, or would benefit from writing support.

Why this topic?

In recent years, we’ve somewhat moved away from the longtermism label (I at least see it pop up far less often on the Forum). Partially this is for reasons well phrased by Scott Alexander in his post "Long-Termism" [sic][5] vs. "Existential Risk".

A movement whose descriptive priorities can be explained as extinction risk reduction may as well just say that, rather than appealing to a philosophy which concluded that extinction risk reduction is important. However, if we don’t discuss longtermism, we also won’t discuss a range of potential projects not captured by common sense morality or extinction reduction. These are projects that aim for trajectory change, or a better future.

Now, terms like “post-AGI-governance” are starting to pop up… attempts to seed projects which hope to improve the long term future in ways other than merely (haha) ensuring that it exists.

This seems like a point when it is important to ask the question — should we be doing this now? Are there promising projects to be sought out, researched, or directly worked on, which are more important than extinction reduction? Is this area or research and action, a tangent, or a necessity?


Extinction is far narrower than “existential risk”:  The extinction of “earth originating intelligent life” means a future devoid of any value which could have come from those lives.  But “existential risk” is a broader term which includes irreversibly bad futures — full of suffering, led by an immortal dictator, trapped in pre-industrial scarcity forever. The future’s value could be below 0. In this debate, the ‘value of the future’ side, not the ‘extinction’ side, incorporates these risks which don’t route through extinction.

Edit: There has been some discussion of the haziness of the extinction definition in the comments. The solution to this is difficult - "extinction" seems like it should be easier to define than "existential risk", but it has its own issues. One odd scenario is a future where morally valuable, conscious, AI systems are living good lives, but at some point in the past, they killed off all the humans. Under the definition we are using in this debate this would not count as extinction (even though humans are gone, the value of the future is still secured by the descendant AI systems). For the purpose of our week long debate, I'll treat this as a feature not a bug. If you think this would be a bad and likely outcome, then that might be a reason to vote disagree on the statement. 

We are treating extinction reduction and increasing the value of the future as mutually exclusive: In order to make this a debate of (at least) two sides, we are separating interventions which reduce the risk of extinction, and interventions that increase the value of the future, via means other than extinction reduction. Otherwise, a substantive position in the argument, that extinction risk reduction is the best way to increase the value of the future, would be recorded as a neutral vote, rather than a strong agree.

Tractability is a part of this conversation. Including “on the margin” in the debate statement means that we can’t avoid thinking about tractability — i.e, where extra effort would actually do the most good, today. This makes the debate harder, but more action relevant. Remember though — just because the core debate question relies on claims about tractability, you can write about anything that could meaningfully influence a vote.

Please do ask for clarification publicly below - I'll add to this section if multiple people are confused about something. 

How to take part:

Vote and comment

The simplest way to contribute to the debate week (though you can make it as complex as you like) is voting on the debate week banner, and commenting on your vote, describing your position. This comment will be attached to your icon on the banner, but it’ll also be visible on the debate week discussion thread, which will look like this.


Everyone is invited to write posts for debate week! All you need to do is tag them with the “Existential Choices Debate Week” tag. Posts don’t have to end with a clear position on the debate statement to be useful. They can also:

  • Summarise other arguments and classify them.
  • Bring up considerations which might influence someone’s position on the statement.
  • Crosspost content for elsewhere, which contributes to the debate.

Message me if you have questions or would like feedback (anything from “is this post suitable?” to “does this post make my point clearly enough?”)

Turn up for the Symposium

We’ll hold the “Symposium” on Monday of debate week. It’ll (probably)[7] be a post, like an AMA post, where a conversation will happen in the comments, between Will MacAskill and other experts.  If you log onto the Forum at that time, you can directly take part in the debate, as a commenter.

We reserve the right to announce different moderation rules for this conversation. For example, we’ll consider hiding comments that aren’t on topic to make sure the discussion stays valuable.

Further reading:

A helpful term, “MaxipOK”, comes from this paper by Nick Bostrom. In it, he writes:

  • “It may be useful to adopt the following rule of thumb for moral action; we can call it Maxipok: Maximize the probability of an okay outcome [bolding mine], where an “ okay outcome” is any outcome that avoids existential disaster. At best, this is a rule of thumb, a prima facie suggestion, rather than a principle of absolute validity, since there clearly are other moral objectives than preventing terminal global disaster.”


If there are other posts you think more people should read, please comment them below. I might highlight them during the debate week, or before. 

  1. ^

     ‘on the margin’ = think about where we would get the most value out of directing the next indifferent talented person, or indifferent funder.

  2. ^

     ‘our’ = earth-originating intelligent life (i.e. we aren’t just talking about humans because most of the value in expected futures is probably in worlds where digital minds matter morally and are flourishing)

  3. ^

     Through means other than extinction risk reduction.  

  1. ^

     This may change based on the preferences of the participants. More details will come soon. 

  2. ^

     Sorry- I’m reading a book right now (The Power Broker) with some really snarkily placed [sic]s and I couldn’t help it. Longtermism is the philosophy, long-termism is the vibe of Long Now, The Long View, the Welsh Future Generations Commission, etc…

  3. ^

    Or very close to zero when compared to other future trajectories. For example, worlds where only a small population of intelligent life exists on earth for a relatively short time, are often treated as extinction scenarios when compared to worlds where humans or their descendants occupy the galaxy. 

  4. ^

     This depends on the preferences of the participants, which are TBC.

Show all footnotes

Sorted by Click to highlight new comments since:

I want to make salient these propositions, which I consider very likely:

  1. In expectation, almost all of the resources our successors will use/affect comes via von Neumann probes (or maybe acausal trade or affecting the simulators).
  2. If 1, the key question for evaluating a possible future from scope-sensitive perspectives is will the von Neumann probes be launched, and what is it that they will tile the universe with? (modulo acausal trade and simulation stuff)
  3. [controversial] The best possible thing to tile the universe with (maybe call it "optimonium") is wildly better than what you get if you not really optimizing for goodness,[1] so given 2, the key question is will the von Neumann probes tile the universe with ~the best possible thing (or ~the worst possible thing) or something else?

Considerations about just our solar system or value realized this century miss the point, by my lights. (Even if you reject 3.)

  1. ^


    Call computronium optimized to produce maximum pleasure per unit of energy "hedonium," and that optimized to produce maximum pain per unit of energy "dolorium," as in "hedonistic" and "dolorous." Civilizations that colonized the galaxy and expended a nontrivial portion of their resources on the production of hedonium or dolorium would have immense impact on the hedonistic utilitarian calculus. Human and other animal life on Earth (or any terraformed planets) would be negligible in the calculation of the total. Even computronium optimized for other tasks would seem to be orders of magnitude less important.

    So hedonistic utilitarians could approximate the net pleasure generated in our galaxy by colonization as the expected production of hedonium, multiplied by the "hedons per joule" or "hedons per computation" of hedonium (call this H), minus the expected production of dolorium, multiplied by "dolors per joule" or "dolors per computation" (call this D).

Given 3, a key question is what can we do to increase P(optimonium | ¬ AI doom)?

For example:

  • Averting AI-enabled human-power-grabs might increase P(optimonium | ¬ AI doom)
  • Averting premature lock-in and ensuring the von Neumann probes are launched deliberately would increase P(optimonium | ¬ AI doom), but what can we do about that?
  • Some people seem to think that having norms of being nice to LLMs is valuable for increasing P(optimonium | ¬ AI doom), but I'm skeptical and I haven't seen this written up.

(More precisely we should talk about expected fraction of resources that are optimonium rather than probability of optimonium but probability might be a fine approximation.)

I think reducing the risk of misaligned AI takeover looks like a pretty good usage of people on the margin. My guess is that misaligned AI takeover typically doesn't result in extinction in the normal definition of the term (killing basically all humans within 100 years). (Maybe I think the chance of extinction-defined-normally given AI takeover is 1/3.)

Thus, for me, the bottom line of the debate statement comes down to whether misaligned AI takeover which doesn't result in extinction-defined-normally actually counts as extinction in the definition used in the post.

I don't feel like I understand the definition you give of "a future with 0 value" handles cases like:

"Misaligned AIs takeover and have preferences that on their own have ~0 value from our perspective. However, these AIs keep most humans alive out of a small amount of kindness and due to acausal trade. Additionally, lots of stuff happens in our lightcone which is good due to acausal trade (but this was paid for by some entity that shared our preferences). Despite this, misaligned AI takeover is actually somewhat worse (from a pure longtermist perspective) than life on earth being wiped about prior to this point, because aliens were about 50% likely to be able to colonize most of our lightcone (or misaligned AIs they create would do this colonization) and they share our preferences substantially more than the AIs do."

More generally, my current overall guess at a preference ordering something like: control by a relatively enlightened human society that shares my moral perspectives (and has relatively distributed power > human control where power is roughly as democratic as now > human dictator > humans are driven extinct but primates aren't (so probably other primates develop an intelligent civilization in like 10-100 million years) > earth is wiped out totally (no AIs and no chance for intelligent civilization to re-evolve) > misaligned AI takeover > earth is wiped out and there aren't aliens so nothing ever happens with resources in our lightcone > various s-risk scenarios.

What line here counts as "extinction"? Does moving from misaligned AI takeover to "human control where power is roughly as democratic as now" count as an anti extinction scenario?

I'm not sure whether to count AI takeover as extinction or just as a worse future - maybe I should define extinction as actually just literal extinction, and leave scenarios with very small populations out of the definition. Any thoughts on the best way to define it here? I agree it needs some refining. 

How about 'On the margin, work on reducing the chance of our extinction is the work that most increases the value of the future'?

As I see it, the main issue with the framing in this post is that the work to reduce the chances of extinction might be the exact same work as the work to increase EV conditional on survival. In particular, preventing AI takeover might be the most valuable work for both. In which case the question would be asking to compare the overall marginal value of those takeover-prevention actions with the overall marginal value of those same actions.

(At first glance it's an interesting coincidence for the same actions to help the most with both, but on reflection it's not that unusual for these to align. Being in a serious car crash is really bad, both because you might die and because it could make your life much worse if you survive. Similarly with serious illness. Or, for nations/cities/tribes throughout history, losing a war where you're conquered could lead to the conquerors killing you or doing other bad things to you. Avoiding something bad that might be fatal can be very valuable both for avoiding death and for the value conditional on survival.)

That's a really interesting solution - I'm a bit swamped today but I'll seriously consider this tomorrow - it might be a nice way to clarify things without changing the meaning of the statement for people who have already written posts. Cheers!

I think I'll stick with this current statement - partly because it's now been announced for a while so people may be relying on its specific implications for their essays, but also because this new formulation (to me) doesn't seem to avoid the problem you raise, that it isn't clear what your vote would be if you think the same type of work is recommended for both. Perhaps the solution to that issue is in footnote 3 on the current banner - if you think that the value from working on AI takeover is mostly from avoiding extinction, then you should vote agree. If you think it is from increasing the value of the future by another means (such as more democratic control of the future by humans), then you should vote disagree. 

Great news! 

> If there are other posts you think more people should read, please comment them below. I might highlight them during the debate week, or before. 

I am in the process of publishing a series of posts ("Evaluating the Existence Neutrality Hypothesis") related to the theme of the debate ("Extinction risks" VS "Alignment risks / Future value"). The series is about evaluating how to update on those questions given our best knowledge about potential space-faring civilizations in the universe. 

I will aim to publish several of the remaining posts during the debate week.

One key question for the debate is: what can we do / what are the best ways to "increas[e] the value of futures where we survive"?

My guess is it's better to spend most effort on identifying possible best ways to "increas[e] the value of futures where we survive" and arguing about how valuable they are, rather than arguing about "reducing the chance of our extinction [vs] increasing the value of futures where we survive" in the abstract.

I agree- this is what I mean by my clarification of the tractability point above. One of the biggest considerations for me personally in this debate is whether there are any interventions in the 'increasing the value of the future' field which are as robust in their value as extinction risk reduction. 

I really like that you've chosen this topic and think it's an important one! I wrote my MA Philosophy thesis on this (in 2019, now outdated).

On the margin[1], it is better to work on reducing the chance of our[2] extinction than increasing the value of futures where we survive

I want to flag that I disagree with this framing, as it's very anthropocentric. There are futures in which we go extinct but that are nevertheless highly valuable (happy sentient AI spreading via VNM probes). Perhaps more empirically relevant, I expect almost all effects to go via making the transition to superintelligence go well, and the most distinct action is focusing on digital sentience (which has little effect on extinction risk and much effect on the value of the future). 

Does footnote #2 on the debate statement cover this? "Our" and "We" are supposed to refer to "earth-originating intelligent life", so "happy sentient AI spreading via VNM probes" would be included. 

[edit- I'm very open to rephrasing it if not]

Ah yeah that seems fine then! "Life" is an imprecise term and I'd prefer "sentience" or "sentient beings" but maybe I'm overdoing it

PS- would it still be worth sharing the thesis, or some thoughts from it? You could claim late draft amnesty if you'd like to post it without editing it :)

I'm going to struggle to cast a meaningful vote on this, since I find 'existential risk' terminology as used in the OP more confusing than helpful, since e.g. it includes nonexistential considerations and in practice excludes non-extinction catastrophes from a discussion they should very much be in, in favour of work on the heuristical-but-insufficient grounds of focusing on events that have maximal extinction probability (i.e. AI). 

I've argued here that non-extinction catastrophes could be as or more valuable to work on than immediate extinction events, even if all we care about is the probability of very long-term survival. For this reason I actually find Scott's linked post extremely misleading, since it frames his priorities as 'existential' risk, then pushes people entirely towards working on extinction risk - and gives reasons that would apply as well to non-extinction GCRs. I gave some alternate terminology here, and while I don't want to insist on my own clunky suggestions, I wish serious discussions would be more precise.

Thanks! Any suggestions for making the clarification of extinction in this post more precise (while still being explainable)?

It's difficult if the format requires a 1D sliding scale. I think reasonable positions can be opposed on AI vs other GCRs vs infrastructure vs evidenced interventions, and future (if it exists) is default bad vs future is default good, and perhaps 'future generations should be morally discounted' vs not.

Ah yes I agree that the 1 dimensional slider doesn't represent anyone's entire opinion. But I also think it shouldn't - this is why debate week is also a week for writing posts, and we integrate comments with the banner. There are many considerations that could affect your vote, and that's great - that's (hopefully) why the week will be generative. 

According to my understanding the last two posts from the "Further reading" section rather represent disagreement with the proposed debate statement given their emphasis on s-risks.

Thanks for the close reading- wish I could say that had been a test. I'll edit it now :)

The debate sets aside a founding principle of EA. 

 In the canonical story of the playpump, we had an inventor who imagined the future benefit of his dual use water-well playground.  He failed.  We learned that we waste money when we rely on how well we imagine our ideas will work.  We learned that we need to build and measure the results.

 Those of us who speculate on their efficacy of curbing extinction risk and those of us who speculate on their improvement of future well being are all in the same boat as the inventor of the playpump.

I don't think this is the greatest analogy, but I do agree that there are huge risks in imagination. I would say EAs failure (in my mind) through funding Open AI and betting the house on alignment research could almost be considered kind of "play pump-esque" examples. Good ideas coming from the best intentions, but without past evidence as a rock to stand on.

Curated and popular this week
Paul Present
 ·  · 28m read
Note: I am not a malaria expert. This is my best-faith attempt at answering a question that was bothering me, but this field is a large and complex field, and I’ve almost certainly misunderstood something somewhere along the way. Summary While the world made incredible progress in reducing malaria cases from 2000 to 2015, the past 10 years have seen malaria cases stop declining and start rising. I investigated potential reasons behind this increase through reading the existing literature and looking at publicly available data, and I identified three key factors explaining the rise: 1. Population Growth: Africa's population has increased by approximately 75% since 2000. This alone explains most of the increase in absolute case numbers, while cases per capita have remained relatively flat since 2015. 2. Stagnant Funding: After rapid growth starting in 2000, funding for malaria prevention plateaued around 2010. 3. Insecticide Resistance: Mosquitoes have become increasingly resistant to the insecticides used in bednets over the past 20 years. This has made older models of bednets less effective, although they still have some effect. Newer models of bednets developed in response to insecticide resistance are more effective but still not widely deployed.  I very crudely estimate that without any of these factors, there would be 55% fewer malaria cases in the world than what we see today. I think all three of these factors are roughly equally important in explaining the difference.  Alternative explanations like removal of PFAS, climate change, or invasive mosquito species don't appear to be major contributors.  Overall this investigation made me more convinced that bednets are an effective global health intervention.  Introduction In 2015, malaria rates were down, and EAs were celebrating. Giving What We Can posted this incredible gif showing the decrease in malaria cases across Africa since 2000: Giving What We Can said that > The reduction in malaria has be
Neel Nanda
 ·  · 1m read
TL;DR Having a good research track record is some evidence of good big-picture takes, but it's weak evidence. Strategic thinking is hard, and requires different skills. But people often conflate these skills, leading to excessive deference to researchers in the field, without evidence that that person is good at strategic thinking specifically. I certainly try to have good strategic takes, but it's hard, and you shouldn't assume I succeed! Introduction I often find myself giving talks or Q&As about mechanistic interpretability research. But inevitably, I'll get questions about the big picture: "What's the theory of change for interpretability?", "Is this really going to help with alignment?", "Does any of this matter if we can’t ensure all labs take alignment seriously?". And I think people take my answers to these way too seriously. These are great questions, and I'm happy to try answering them. But I've noticed a bit of a pathology: people seem to assume that because I'm (hopefully!) good at the research, I'm automatically well-qualified to answer these broader strategic questions. I think this is a mistake, a form of undue deference that is both incorrect and unhelpful. I certainly try to have good strategic takes, and I think this makes me better at my job, but this is far from sufficient. Being good at research and being good at high level strategic thinking are just fairly different skillsets! But isn’t someone being good at research strong evidence they’re also good at strategic thinking? I personally think it’s moderate evidence, but far from sufficient. One key factor is that a very hard part of strategic thinking is the lack of feedback. Your reasoning about confusing long-term factors need to extrapolate from past trends and make analogies from things you do understand better, and it can be quite hard to tell if what you're saying is complete bullshit or not. In an empirical science like mechanistic interpretability, however, you can get a lot more fe
 ·  · 15m read
“I” refers to Zach, the Centre for Effective Altruism's CEO. Oscar is CEA’s Chief of Staff. We are grateful to all the CEA staff and community members who have contributed insightful input and feedback (directly and indirectly) during the development of our strategy and over many years. Mistakes are of course our own. Exec summary As one CEA, we are taking a principles-first approach to stewardship of the EA community. During the search for a new CEO, the board and search committee were open to alternative strategic directions, but from the beginning of my tenure, we’ve committed to a strategy under which we will: * Operate as one CEA, rather than winding down, breaking up or renaming the organization. Instead of optimizing for each of our team’s programs, we’ll be optimizing for EA as a whole. * Take a principles-first approach to EA, rather than becoming an AI org or otherwise re-orienting ourselves to specific causes. * Take greater responsibility for stewardship of the EA community, rather than restricting ourselves to passively providing infrastructure and support. This post explores stewardship in greater detail. Stewardship is about actors taking more responsibility for reaching and raising EA’s ceiling, and we believe CEA should play a leading role in steering, supporting and coordinating the community. Importantly, however, stewardship of EA is not ownership of EA: we don’t want to be the only leaders, and we do want a close collaboration with the community. During 2024 we focussed on building strong foundations that CEA will require to succeed at stewarding the community, including making over 20 hires (having started the year with 34 staff) while cutting a quarter of our costs, and developing our strategy for 2025 and 2026, including by listening to and learning from members of the EA community during visits I made to over half a dozen countries and in more than 200 one-on-one meetings. I feel good about the foundations we built and having priori