Two tools for rethinking existential risk

Arepo

Comments 14

Sorted by

New & upvoted

I think this is a very valuable project.

But this is still a combination of two questions, the latter of which longtermists have never, to my knowledge, considered probabilistically:^[3]
What is the probability that the event kills all living humans?
What effect does the event otherwise have on the probability that we eventually reach an interstellar/existentially secure state, ^[4] given the possibility of multiple civilisational collapses and ‘reboots’? (where the first reboot is the second civilisation)
3^{^}
The closest thing I know to such an attempt is Luisa Rodriguez’s post What is the likelihood that civilizational collapse would cause technological stagnation? (outdated research), in which she gives some specific probabilities of the chance of a preagricultural civilisation recovering industry based on a grid of extinction rates and scenarios which, after researching the subject, she found reasonably plausible. But this relates only to a single instance of trying to do this (on my reading, specifically the first time, since she imagines the North Antelope Rochelle Coal Mine still having reserves), and only progresses us approximately as far as early 19th century England. Also, per the title’s addendum, she now considers the conclusion too optimistic, but doesn’t feel comfortable giving a quantified update.

I also have not seen analyses of multiple reboots. But in terms of recovery from one loss of civilization, What We Owe the Future touches on it some. Also, my original cost-effectiveness analysis for the long-term future for nuclear war explicitly modeled recovery from collapse. However, then I realized that there were other mechanisms to long-term future impact, such as making global totalitarianism more likely or resulting in worse values in AGI, so I moved to reduction in long-term future value associated with nuclear war or other catastrophes. I like that you are breaking this up into more terms and more reboots, because I think that will result in more accurate modeling.

Arepo

Thanks for the kind words, David. And apologies - I'd forgotten you'd published those explicit estimates. I'll edit them in to the OP.

My memory of WWOtF is that Will talks about the process, but other than giving a quick estimate of '90% chance we recover without coal, 95% chance with' he doesn't do as much quantifying as you and Luisa.

Also Lewis Dartnell talked about the process extensively in The Knowledge, but I don't think he gives any estimate at all about probabilities (the closest I could find was in an essay for Aeon where he opined that 'an industrial revolution without coal would be, at a minimum, very difficult').

CB🔸

Thanks for the calculator.

I was wondering about the welfare part of the equation, and it's not obvious how people get their welfare estimates in the calculator, from what I see in the post.

Are we talking about the welfare of just humans ? Animals ? (Farmed or wild animals)? Artificial sentience ? How do we reconcile all of these when we're not sure today whether global welfare is net positive ?

Of course, this depends on very important questions that are hard to assess. What are the consequences of bringing wild animals suffering to our planet? Is factory farming going to continue for a long time, especially as that long-termists are very optimistic about technology replacing all forms of animal farming, where it's not so obvious? Are artificial sentience going to have lives worth living ? How are we going to impact animals on other planets ?

So overall, what should we include in the 'welfare' part of the calculator?

Arepo

Hey Corentin,

The calculators are intentionally silent on the welfare side, on the thought that in practice it's much easier to treat as a mostly independent question. That's not to say it actually is independent, and ideally I would like the output to include more information about what the pathways to either extinction or an interstellar state, so that people can do some further function on the output. I do think it's reasonable, even on a totalising view, to prioritise improving future welfare conditional on it existing and largely ignoring the question of whether it will - but that's not a question the calculators can help with except inasmuch as you condition on the pathway.

Even if they gave pathways, they would be agnostic on whose welfare qualified. Personally I'm interested in maximising total valence (I have an old essay still waiting for its conclusion on the subject), so every sentient being's mental state 'counts', but you could use these with a different perspective in mind. Primarily empirical questions about e.g. the duration of factory farming, and animal suffering in terraformed systems seem like they'd need their own research projects.

Ruben Dieleman 🔸

Woah, this goes way over my head, so I'm gonna keep on re-reading it until I understand it (hopefully). Thanks for this post!

Arepo

I'm happy to talk you through using it if you're finding it confusing.

If you (or anyone else) reading this wants to catch me for some support, I'm on the EA Gather Town as much as possible (albeit currently in New Zealand time), so you can log in there and ping me :)

Deborah W.A. Foulkes

Can't find the EA Gather Town via this link or on the Gather app. Can you give its exact handle/label? Thanks.

Arepo

Hm, the link works ok for me. What happens when you open it? It can be a bit shonky on mobile phones - maybe try using it on a laptop/desktop if you haven't.

It's called 'EA coworking and lounge', if that helps.

Vasco Grilo🔸

Thanks for all your work on this series, and sharing a draft of the post in advance! I commented around 1.5 years ago that I thought it was a pretty valuable series, but I am now much less optimistic:

I think the annual risk of human extinction not involving transformative AI (TAI) is astronomically low. I estimated 5.93*10^-12 for nuclear wars, 2.20*10^-14 for asteroids and comets, 3.38*10^-14 for supervolcanoes, a prior of 6.36*10^-14 for wars, and a prior of 4.35*10^-15 for terrorist attacks.
I believe there is a high chance of full recovery given human extinction not involving TAI (relatedly). For example, I think the chance of not fully recovering would only be 0.0513 % (= e^(-10^9/(132*10^6))) for a repetition of the last mass extinction 66 M years ago, the Cretaceous–Paleogene extinction event. I got my estimate assuming:
- An exponential distribution with a mean of 132 M years (= 66*10^6*2) represents the time between i) human extinction in such catastrophe and ii) the evolution of an intelligent sentient species after such a catastrophe. I supposed this on the basis that:
  - An exponential distribution with a mean of 66 M years describes the time between:
    - 2 consecutive such catastrophes.
    - i) and ii) if there are no such catastrophes.
  - Given the above, i) and ii) are equally likely. So the probability of an intelligent sentient species evolving after human extinction in such a catastrophe is 50 % (= 1/2).
  - Consequently, one should expect the time between i) and ii) to be 2 times (= 1/0.50) as long as that if there were no such catastrophes.
- An intelligent sentient species has 1 billion years to evolve before the Earth becomes habitable.
I would say one cannot assume the value of the future becomes negligible following human extinction involving TAI (relatedly).

You assume the value of the future given extinction is negligible. In order for this to make sense to me, I have to interpret the extinction as not involving TAI, and involving not only the loss of all humans, but also a significant part of our past evolutionary path (to ensure a low chance of full recovery). I suppose the annual probability of such extinction is lower than 10^-8, and therefore do not see tail risk mitigation as a specially promising path to avoid it. I guess indirect ways of decreasing extinction risk like boosting economic growth (relatedly) become more attractive for lower levels of risk.

Arepo

Hey Vasco, thanks for the in-depth reply, and thanks again for trawling over this behemoth :)

Let me take these points in order:

I think the annual risk of human extinction not involving transformative AI (TAI) is astronomically low.

I'm highly sceptical of point probability estimates for events for which we have virtually no information - that's exactly why I made these tools. Per Dan Schwarz's recent post, it seems much more important to me to give an interactive model into which people can put their own credences, so that we can then debate the input rather than the output.

I'm now reading through your nuclear war article, and have some pushback, but I don't want to get them sidetracked into it here (I'll try and post them as a comment there, which is probably more helpful anyway), and I don't think they'd increase my credence enough to materially affect your point.

More importantly, much of the point of the calculators is that one can still have very low credence of direct extinction from any of the sources you mentioned and still believe that such events substantially reduce the chance of us becoming interstellar by two basic mechanisms:

Reverting us to states where we might have to spend millennia or longer at standard-or-slightly-increased background risk of natural disasters, diseases, being outcompeted by other species, etc.
Preferentially consuming resources to the point where future civilisations might have to spend centuries or millennia in a time of perils because e.g. all fossil fuels and fissile materials are exhausted, all rock phosphorus has been tossed into the ocean, virtually all rare earth elements, platinum group metals and perhaps even most copper has corroded into unusable forms. Think having destructive biotechnology potential for thousands of years before we can even travel to the moon again, because it takes a national economy years to get enough money just to pay for the insulation of the internal wiring. Even if it remains theoretically possible to develop multiplanetary/interstellar technology from this state, this might mean that we have to go through such extended periods of high technological risk to get there that in practice the odds of successfully navigating such an extended time of perils - even if we had hundreds of tries - rapidly approach 0.

I believe there is a high chance of full recovery given human extinction not involving TAI

If by 'recovery' you mean 'reaching modern technology' this is compatible with what I've just written above - it might turn out to be relatively trivial to rereach modern technology, but increasingly implausible that we can ever progress beyond it.

If I understand you right, you're getting most of your confidence in recovery from situations where other intelligent species evolve? If so then this scenario seems like something longtermists shouldn't view too positively, though we could still use these calculators plus some other tools to model it: -

as with shorter term recoveries, if it happened to us, it could happen again to the new species, so one would need a similar cyclic model to look at the overall effect on probability of success. You could probably just repurpose this model, mentally replacing 'preindustrial' with 'extinct but evolving new intelligent species'.
on the timescale that could take, we have to think seriously about the amount of loss of future value due to expansion of the universe. I think Anders Sandberg is working on modelling this.
If one is concerned that our current civilisation's values are contingent and counterfactually positive, it seems still less likely that a new species would share them.
intelligence might just not re-evolve to the degree needed to develop advanced technology. There are a number of theories about why we developed it to such a degree - despite abstract reasoning having virtually no survival value to hunter gatherer or even agricultural societies - and if something like e.g. the sexual selection theory is correct, it might have been a one-off evolutionary accident, never to be repeated.
even after tens of millions of years, most of the resources I described above would still be gone if they depleted them. Fossil fuels accumulated throughout the period of complex life on this planet which we're about halfway through - so we could expect at best to have approximately the same number once more, if no civilisation ever used any in the meantime. Fissile materials will never recover, and it's hard to see how eg. a thin layer of rock phosphorus scattered across the bottom of the ocean would ever return to an economically viable state, even on geological timelines. So the new species might evolve only to find themselves trapped on the surface.

An intelligent sentient species has 1 billion years to evolve before the Earth becomes habitable.

Based on the dramatic changes to the climate that precede the oceans evaporating I suspect Earth will become unhabitable to intelligent life in more like 100million to 500million years. If we're relying on reevolution of intelligent life, that might meaningfully cut down the number of chances we get.

You assume the value of the future given extinction is negligible. In order for this to make sense to me, I have to interpret the extinction as not involving TAI

I explicitly aimed to capture this concern in the OP description 'human descendants (or whatever class of life we think has value)'. If you think TAI replacing humans would be as good or better, you can treat scenarios where it does so as transitioning directly from whatever state we'd be in at the time to an interstellar/existentially secure state.

Fwiw in the Matthew Barnett post you linked to, I replied that I strongly support that position philosophically - I think my take was even more pro-conscious-AI than his.

Vasco Grilo🔸

Thanks for the follow up! I strongly upvoted it.

I'm highly sceptical of point probability estimates for events for which we have virtually no information - that's exactly why I made these tools.

I mentioned point/mean probability estimates, but my upper bounds (e.g. 90th percentile) are quite close, as they are strongly limited by the means. For example, if one's mean probability is 10^-10, the 90th percentile probability cannot be higher than 10^-9, otherwise the mean probability would be higher than 10^-10 (= (1 - 0.90)*10^-9), which is the mean. So my point remains as long as you think my point/mean estimates are reasonable.

Per Dan Schwarz's recent post, it seems much more important to me to give an interactive model into which people can put their own credences, so that we can then debate the input rather than the output.

Makes sense. I liked that post. I think my comment was probably overly crictical, and not related specifically to your series. I was not clear, but I meant to point to the greater value of using standard cost-effectiveness analyses (relative to using a model like yours) given my current empirical beliefs (astronomically low non-TAI extinction risk).

More importantly, much of the point of the calculators is that one can still have very low credence of direct extinction from any of the sources you mentioned and still believe that such events substantially reduce the chance of us becoming interstellar by two basic mechanisms

Fair! I suspect the number of lives saved, maybe weighted by the reciprocal of the population size, would still be a good proxy for the benefits of interventions affecting civilisational collapse. When I tried to look into this quantitatively in the context of nuclear war, improving worst case outcomes did not appear to be the driver of the overall expected value. So I am guessing using standard cost-effectiveness analyses, based on a metric like lives saved per dollar, would continue to be a fair way of assessing interventions.

In any case, I assume your model would still be useful for people with different views!

If by 'recovery' you mean 'reaching modern technology' this is compatible with what I've just written above - it might turn out to be relatively trivial to rereach modern technology, but increasingly implausible that we can ever progress beyond it.

I meant full recovery in the sense of reaching the same state we are in now, with roughly the same chances of becoming a benevolent interstellar civilisation going forward.

If I understand you right, you're getting most of your confidence in recovery from situations where other intelligent species evolve?

For my estimate of a probability of 0.0513 % of not fully recovering, yes, because I was assuming human extinction in my calculation. If the disaster is less severe, my probability of not fully recovering would be even lower.

as with shorter term recoveries, if it happened to us, it could happen again to the new species

If one thinks the probability of extinction or permanent collapse without TAI is astronomically low (as I do), the probability of a double catastrophe is astronomically low to the power of 2, i.e. it presents negligible risk. So I believe a single catastrophe has to be somewhat plausible for the possibility of further catastrophes to matter.

on the timescale that could take, we have to think seriously about the amount of loss of future value due to expansion of the universe. I think Anders Sandberg is working on modelling this.

I think considerations related the astronomical waste argument are applicable not only to saving lives in catastrophes, but also in normal time. To bring everything into the same framework in a simple way, I would run a standard cost-effectiveness analysis, but weighting saving lives by a function of the population size (e.g. 1/"population size" as I suggested above).

If one is concerned that our current civilisation's values are contingent and counterfactually positive, it seems still less likely that a new species would share them.

On priors, I would say one should expect the values of a new similarly capable species to be as good as those of humans.

intelligence might just not re-evolve to the degree needed to develop advanced technology. There are a number of theories about why we developed it to such a degree - despite abstract reasoning having virtually no survival value to hunter gatherer or even agricultural societies - and if something like e.g. the sexual selection theory is correct, it might have been a one-off evolutionary accident, never to be repeated.

I imagine different theories make significantly different predictions about the difficulty of going from e.g. monkeys to humans, but I have the impression there is often little data to validate them, and therefore think significant weight should be given to a prior simply informed by how long a given transition took. This is why I got my estimate for the probability of not fully recovering relying on the time since the last mass extinction.

even after tens of millions of years, most of the resources I described above would still be gone if they depleted them. Fossil fuels accumulated throughout the period of complex life on this planet which we're about halfway through - so we could expect at best to have approximately the same number once more, if no civilisation ever used any in the meantime. Fissile materials will never recover, and it's hard to see how eg. a thin layer of rock phosphorus scattered across the bottom of the ocean would ever return to an economically viable state, even on geological timelines. So the new species might evolve only to find themselves trapped on the surface.

Even assuming we could only recover with pretty high likelihood once, we would need 2 events with astronomically low chances to go from where we are to extinction or permanent collapse. So the overall risk of these would be negligible if one agrees with my estimates. At the same time, I think you are raising great points. I do not think they have much strenght, but this is just because of my view that catastrophes plausibly leading to extinction or global collapse are very unlikely.

I explicitly aimed to capture this concern in the OP description 'human descendants (or whatever class of life we think has value)'. If you think TAI replacing humans would be as good or better, you can treat scenarios where it does so as transitioning directly from whatever state we'd be in at the time to an interstellar/existentially secure state.
Fwiw in the Matthew Barnett post you linked to, I replied that I strongly support that position philosophically - I think my take was even more pro-conscious-AI than his.

Makes sense. I think human extinction caused by TAI would be bad if it happened in the next few years, as I suppose there would not be enough time to plan a good transition in this case. Nevertheless, in a future dominated by sentient benevolent advanced AIs, human extinction would not be obviously bad.

Arepo

Yeah, it sounds like this might not be appropriate for someone with your credences, though I'm confused by what you say here:

I mentioned point/mean probability estimates, but my upper bounds (e.g. 90th percentile) are quite close, as they are strongly limited by the means. For example, if one's mean probability is 10^-10, the 90th percentile probability cannot be higher than 10^-9, otherwise the mean probability would be higher than 10^-10 (= (1 - 0.90)*10^-9), which is the mean. So my point remains as long as you think my point/mean estimates are reasonable.

I'm not sure what you mean by this. What are you taking the mean of, and which type of mean, and why? It sounds like maybe you're talking about the arithmetic mean? If so that isn't how I would think about unknown probabilities fwiw. IMO it seems more appropriate to use a geometric mean to express this kind of uncertainty, or explicitly model the distribution of possible probabilities. I don't think either approach should limit your high-end credences.

Makes sense. I liked that post. I think my comment was probably overly crictical, and not related specifically to your series. I was not clear, but I meant to point to the greater value of using standard cost-effectiveness analyses (relative to using a model like yours) given my current empirical beliefs (astronomically low non-TAI extinction risk).

Yeah, fair enough :)

If one thinks the probability of extinction or permanent collapse without TAI is astronomically low (as I do)

Have you written somewhere about why you think permanent collapse is so unlikely? The more I think about it, the higher my credence seems to get :\

I have the impression there is often little data to validate them, and therefore think significant weight should be given to a prior simply informed by how long a given transition took.

I'm not saying the sexual selection theory is strongly likely to be correct. But it seems to be taken seriously by evolutionary psychologists, and if you're finding that other theories of human intelligence give ultra-high credence of a new species evolving, it seems like that credence should be substantially lowered by even a modest belief in the plausibility of such theories.

Vasco Grilo🔸

What are you taking the mean of, and which type of mean, and why? It sounds like maybe you're talking about the arithmetic mean?

Yes, I was referring to the arithmetic mean of a probability distribution. To illustrate, if I thought the probability of a given event was uniformly distributed between 0 and 1, the mean (best guess) probability would be 50 % (= (0 + 1)/2).

IMO it seems more appropriate to use a geometric mean to express this kind of uncertainty, or explicitly model the distribution of possible probabilities.

I agree the median, geometric mean, or geometric mean of odds are usually better than the mean to aggregate forecasts^[1]. However, if we aggregated multiple probability distributions from various models/forecasters, we would end up with a final probability distribution, and I am saying our final point estimate corresponds to the mean of this distribution. Jaime Sevilla illustrated this here.

I don't think either approach should limit your high-end credences.

Maybe it helps to think about this in the context of a distribution which is not over a probability. If we have a distribution over possible profits, and our expected profit is 100 $, it cannot be the case that the 90th percentile profit is 1 M$, because in this case the expected profit would be at least 100 k$ (= (1 - 0.90)*1*10^6), which is much larger than 100 $.

You may want to check Joe Carlsmith's thoughts on this topic in the context of AI risk.

Have you written somewhere about why you think permanent collapse is so unlikely? The more I think about it, the higher my credence seems to get :\

No, at least not in any depth. I think permanent collapse would require very large population and infrastructure losses, but I see these as very unlikely, at least in the absence of TAI. I estimated a probability of 3.29*10^-6 of the climatic effects of nuclear war before 2050 killing 50 % of the global population (based on the distribution I defined here for the famine death rate). Pandemics would not directly cause infrastructure loss. Indirectly, there could be infrastructure loss due people stopping maintenance activities out of fear of being infected, but I guess this requires a level of lethality which makes the pandemic very unlikely.

Besides more specific considerations like the above, I have consistently ended up arriving to tail risk estimates much lower than canonical ones from the effective altruism community. So, instead of regarding these as a prior as I used to do, now I immediately start from a lower prior, as I should not expect by risk estimates to go down/up^[2]. For context on me arriving to lower tail risk estimates, you can check the posts I linked at the start of my 1st comment. Here are 2 concrete examples I discussed elsewhere:

Luisa Rodriguez's analyses, which are arguably somewhat canonical in the effective altruism community, imply 630 M expected deaths before 2050 from nuclear wars between the United States and Russia, whereas I estimated just 2 % of that.
Denkenberger 2022 implies the value of the future would decrease by 12.0 % given a 10 % agricultural shortfall, which corresponds to an injection of soot into the stratosphere of around 5 Tg^[3], whereas I think the longterm impact of this would be negligible. Even given human extinction, I guess the value of the future would only decrease by 0.0513 % (see rough calculation in my 1st comment).

Relatedly, my extinction risk estimates are much lower than Toby Ord's existential risk estimates given in The Precipice.

^{^}
I aggregated probabilities using the median to estimate my prior extinction risk for wars and terrorist attacks, and using the geometric mean to obtain my nuclear war extinction risk.
^{^}
If I expected my best guess to go up/down, I should just update my best guess now to the value I expect it will converge to.
^{^}
Xia 2022 predicts a shortfall of 7.0 % for 5 Tg without adaptation (see last row of Table S2).

SummaryBot

Executive summary: Two new calculators allow longtermists to explicitly model and compare the value of reducing existential risk versus the value of mitigating lesser catastrophes, based on the user's assumptions about civilizational trajectories and prospects for recovery.

Key points:

The simple calculator uses a Markov chain model of civilizational states to estimate the probability of humanity eventually becoming interstellar based on user-specified transition probabilities between states.
The full calculator allows more granular modeling of transitions within and between pre-industrial, industrial, and multi-planetary civilizations, using customizable functions and parameters.
Users can compare the expected value loss from extinction to the value loss from lesser catastrophes, and model the impact of specific interventions or events.
The author provides example outputs showing how results can vary significantly based on optimistic vs. pessimistic assumptions about recovery prospects.
Limitations include the lack of explicit modeling of AI risk, long runtimes for the full calculator, and the need for better tooling for counterfactual exploration. The author invites feedback and contributions.
Longtermists are encouraged to use the calculators to scrutinize common assumptions and share their results to aggregate different perspectives.

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.

Comments

More from the author

239

Revisiting EA's media policy

Arepo·3y ago·8m read

The 'community' tag is problematic

Arepo·10mo ago·2m read

Fruit-picking as an existential risk

Arepo·9mo ago·13m read

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·4d ago·Curated 1d ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

149

Let's taboo the V-word

lincolnq·4d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Spiro: an update 2.5 years on and a fundraising ask for expansion

Habiba Banu·2d ago·6m read

Summary Back in November 2023 I posted here to launch Spiro and raise our first $198k. Two and a half years later this is an update and a fundraiser for the next step. The short version: we've now reached over-5,900 people with TB preventive medicine, including over 3,000 children under five years old. Our early results have held up well an...

Recent opportunities to take action

EA Organisation Updates thread: July 2026

Dane Valerie·3d ago·1m read

Help us launch AI safety university groups by referring potential founders

Jason Chin🔸·15h ago·4m read

Save the date: Swiss AI Safety Days 2026 (7-8 November, ETH Zurich)

Andre Santos 🔸, patrickwidmann, mariuswenk·17h ago·1m read

Vasco Grilo🔸

Thanks for the follow up! I strongly upvoted it.

I'm highly sceptical of point probability estimates for events for which we have virtually no information - that's exactly why I made these tools.

Per Dan Schwarz's recent post, it seems much more important to me to give an interactive model into which people can put their own credences, so that we can then debate the input rather than the output.

More importantly, much of the point of the calculators is that one can still have very low credence of direct extinction from any of the sources you mentioned and still believe that such events substantially reduce the chance of us becoming interstellar by two basic mechanisms

In any case, I assume your model would still be useful for people with different views!

If by 'recovery' you mean 'reaching modern technology' this is compatible with what I've just written above - it might turn out to be relatively trivial to rereach modern technology, but increasingly implausible that we can ever progress beyond it.

I meant full recovery in the sense of reaching the same state we are in now, with roughly the same chances of becoming a benevolent interstellar civilisation going forward.

If I understand you right, you're getting most of your confidence in recovery from situations where other intelligent species evolve?

as with shorter term recoveries, if it happened to us, it could happen again to the new species

on the timescale that could take, we have to think seriously about the amount of loss of future value due to expansion of the universe. I think Anders Sandberg is working on modelling this.

If one is concerned that our current civilisation's values are contingent and counterfactually positive, it seems still less likely that a new species would share them.

On priors, I would say one should expect the values of a new similarly capable species to be as good as those of humans.

intelligence might just not re-evolve to the degree needed to develop advanced technology. There are a number of theories about why we developed it to such a degree - despite abstract reasoning having virtually no survival value to hunter gatherer or even agricultural societies - and if something like e.g. the sexual selection theory is correct, it might have been a one-off evolutionary accident, never to be repeated.

even after tens of millions of years, most of the resources I described above would still be gone if they depleted them. Fossil fuels accumulated throughout the period of complex life on this planet which we're about halfway through - so we could expect at best to have approximately the same number once more, if no civilisation ever used any in the meantime. Fissile materials will never recover, and it's hard to see how eg. a thin layer of rock phosphorus scattered across the bottom of the ocean would ever return to an economically viable state, even on geological timelines. So the new species might evolve only to find themselves trapped on the surface.

I explicitly aimed to capture this concern in the OP description 'human descendants (or whatever class of life we think has value)'. If you think TAI replacing humans would be as good or better, you can treat scenarios where it does so as transitioning directly from whatever state we'd be in at the time to an interstellar/existentially secure state.
Fwiw in the Matthew Barnett post you linked to, I replied that I strongly support that position philosophically - I think my take was even more pro-conscious-AI than his.

^{^}

I aggregated probabilities using the median to estimate my prior extinction risk for wars and terrorist attacks, and using the geometric mean to obtain my nuclear war extinction risk.

^{^}

If I expected my best guess to go up/down, I should just update my best guess now to the value I expect it will converge to.

^{^}

Xia 2022 predicts a shortfall of 7.0 % for 5 Tg without adaptation (see last row of Table S2).

^{^}

While I don’t want to sidetrack the main discussion, it might be worth a tangent into the two distinct moral reasons for pursuing such an intergalactic future. If you already lean towards this view, you can skip this extended footnote.

The first is that if we assume a totalising population axiology (the normal basis for longtermism), for whatever it is we value, more is more. That is, no matter how good we could make things on Earth - even if we could eliminate suffering and almost perfectly convert resources into whatever form we consider most valuable - we can presumably make them comparably good for life elsewhere. Then we can get vastly more of that goodness by expanding into space (back of the envelope: access to ~ $10^{30}$ times more rocky matter over the course of ~ $10^{14}$ years rather than ~ $10^{9}$ years would give us ~ $10^{35}$ times more of whatever we value; optimistically, we might reach numbers that would dwarf even that.).

The second reason, which doesn’t require a totalising axiology, is existential security. I think a naive but reasonable calculation is to treat the destruction of life in each settlement as at least somewhat independent, more so the further apart they are. That would make extinction risk in such a state some kind of exponential decay function of number of self-sustaining settlements, such that the probability of extinction might be $a (1 - b)^{(p - 1)}$ , where a is some constant or function representing the risk of a single-planet civilisation going extinct, b is some decay rate such that $0 <= b <= 1 / 2$ (where 1/2 implies the probabilities of each settlement going extinct are completely independent) and p is the number of self-sustaining settlements in your multiplanetary civilisation.

Three counterpoints to the latter argument are

* Aligned AGI might fix all our problems and make us existentially secure without needing the security of an interstellar state
* An unaligned AGI would always be able to kill an arbitrarily large civilisation
* Some other universe-destroying event would always be possible to trigger

If you think the first of these is the only way to existential security, then in models that follow you can assign 0 probability of reaching a ‘multiplanetary’ state, and suppose that we will either transition directly from a time of perils to an existentially secure state or we won’t become existentially secure.

If you think either the second or third counterpoint is true, then this project and longtermism are both irrelevant - eventually the threat in question will kill everyone, and we should perhaps focus on the short term.

But if AGI doesn’t perenially remain an existential threat and no universe-destroying events are possible (see e.g. soft/no take-off scenarios linked in this thread), the value of this risk function would quickly approach 0.

For more on the existential security argument, see Christopher Lankhof’s Security Among the Stars.

This all presumes the future isn’t net negative in expectation. If you believe that it is, then this project is probably not relevant to you, unless the different pathways of how we might get there seem useful to explore. For example if you think our values might get worse (or better) following civilisational collapse, you might be able to plug this in to some model of that process.

^{^}

Strictly speaking the ‘probabilities’ discussed in this post are more like extrapolated credences, but since - in common with typical longtermist methodology - I apply these credences to probabilistic models, I refer to them as probabilities when it seems more intuitive to do so.

^{^}

The closest thing I know to such an attempt is Luisa Rodriguez’s post What is the likelihood that civilizational collapse would cause technological stagnation? (outdated research), in which she gives some specific probabilities of the chance of a preagricultural civilisation recovering industry based on a grid of extinction rates and scenarios which, after researching the subject, she found reasonably plausible. But this relates only to a single instance of trying to do this (on my reading, specifically the first time, since she imagines the North Antelope Rochelle Coal Mine still having reserves), and only progresses us approximately as far as early 19th century England. Also, per the title’s addendum, she now considers the conclusion too optimistic, but doesn’t feel comfortable giving a quantified update.

[Edit: David Denkenberger also published some relevant risk estimates of the probability of collapse and recovery in Should we be spending no less on alternate foods than AI now?]

^{^}

My inclination is to consider ‘interstellar’ to be close enough to ‘existentially secure’ as to be functionally equivalent; and since ‘interstellar’ is more specific it’s the term I’ve used in the code and elsewhere. But if you think existential security could be reached without becoming interstellar you can mentally replace ‘interstellar’ with ‘existential security’ throughout and set your parameters accordingly.

If you’re concerned that we might not be existentially secure even after we become interstellar, the calculators doesn’t explicitly address that concern - you could either represent it through either

* A high MAX_PLANETS constant in the code along with low probability of becoming ‘interstellar’ from relatively small numbers of planets; or

* Simply plugging in the output of the calculator to some further estimate of p(existential security | humanity becoming interstellar)

^{^}

This isn’t because I think welfare questions are unimportant; they’re just outside the current scope of this project (though a future version could incorporate such questions - see the limitations/development roadmap section, lower down).

^{^}

While I disagree with this view, I mean ‘fanaticism’ in the descriptive sense as used in a different context here , rather than as a pejorative. In this sense it means something like 'tendency to favour risk-neutral maximisation of some function': in this case the function being (1 - <probability of near-term extinction>).

^{^}

In practice longtermist grantmakers often split their grants across extinction-related and smaller catastrophes - e.g. OP and Founders Pledge both have ‘global catastrophic risk’ buckets to cover both. But it’s unclear to what extent they do this on longtermist grounds and to what extent it's justified by putting ‘smaller global catastrophes’ in a different bucket and do in fact only prioritise in terms of extinction risk.

If mainly the latter (different bucket), then the grantmaker is still effectively expressing Parfitian fanaticism. If mainly the former (giving to smaller catastrophes on longtermist grounds), then the grantmaker is tacitly expressing the sort of credences which these calculators explicitly deal with - and therefore hopefully make more accurate.

^{^}

On it oversimplifying number of future people: strictly speaking, as Bostrom observes, the expansion of the universe means we lose a huge amount of value for any substantive delay to our spreading our cosmological wings, but that huge loss looks negligible even over millions of years, compared to even relatively minor changes in the probability of eventually achieving V. This disparity is why longtermists generally focus on safety rather than speed.

On it oversimplifying average value per person: Unlike technological progress, it seems to me unlike technology, there’s no predictable patterns that let us imagine how values would evolve across multiple civilisations. This might simplify things in practice: you could imagine we have some level of moral development $M_{current}$ , and the average for other civilisations is some other level $M_{postapocalyptic}$ . Then you could convert ‘moral development’ into some per-person coefficient. Finally, we can let $P (V_{current})$ be our probability of achieving V without any regressions, and $P (V_{postapocalyptic})$ be our probability of achieving V after at least one regression. This would allow you to compare

$M_{current} \cdot V \cdot P (V_{current})$

$M_{postapocalyptic} \cdot V \cdot P (V_{postapocalyptic})$

^{^}

I somehow only discovered Arvo Morán’s How bad would human extinction be? while writing this post, and it relates closely to the question of how much V would change over time. I’m still digesting the overlap between our work, but I think that a future version of the full calculator could incorporate something like the branching process he describes in this section if treating V as a constant seems to be a simplification too far.

^{^}

To emphasise, this is assuming an abstract longtermist view. In practice we might lean towards averting whichever event which caused most expected short term suffering for many other reasons. This runs contrary to the ‘holy shit, x-risk’ philosophy of emphasising 0.1% probability outcomes in which literally everyone dies over outcomes in which (say) merely 50% of people die, which might be much more likely.

^{^}

I dropped the ‘survival’ state that I originally described two posts ago because a) Luisa’s estimates that suggested it had a very low risk of extinction, and b) my sense that an event that killed >99.99% and <100% of the population was an extremely narrow target, and therefore c) its overall effect on the outcome seemed tiny. I do wonder whether I should reinstate it as a ‘hunter gatherer’ state distinct from agricultural, as a couple of people have suggested.

^{^}

If you don't use the civilisation count for top level transitions, then the top level will be functionally more or less equivalent to the simple calculator (except for having a finite number of possible civilisations).

^{^}

The code is somewhat modular, though less so than I’d like. Let me know if you want some help with inserting your own functions. Or, if you’re interested in helping make the process easier, see Contribute/submit feature requests section below.

^{^}

This magnification can be different in different civilisational states - for example, you might think the increased resource scarcity would be a minor impediment in advancing from a preindustrial state through to a time of perils, a major impediment in a time of perils, and no impediment at all in a multiplanetary state.

One could also fairly easily put conditionals in the code to give special treatment to one or two reboots: for example, to express the view that the first time around we’d still have enough coal reserves to make a substantial difference, and that the second reboot would be much harder, but in reboots after that no other resource would deplete enough to make nearly as large a difference if it did.

In theory, these magnifications could either increase or decrease our prospects after a catastrophe. In my own simulations, though, I assume that the natural economics of each civilisation using up the most valuable resources available at the time will lead to prospects inevitably declining over most reboots.

Even so, in some cases, our prospects seem to improve slightly if we reboot to a second or third time of perils (imagine e.g. a scenario where an economy powered by renewables is comparably as easy to build as a fossil fuel economy - at least in early reboots before we deplete key minerals - and the detritus of the previous civilisations make things even easier by serving as blueprints for many key technological advancements, perhaps more so for benign than destructive technologies).

But to reach such a scenario we might have to get through some post-catastrophe state from which our prospects would be substantially worse - so one would have to be cautious advocating for apocalypse, even under such assumptions.

^{^}

This leads to the awkwardly titled notion of a ‘linear regression algorithm’ in this programme, which has nothing to do with the statistical model of the same name.

^{^}

Given our lack of historical context for this, one could instead use GDP of individual nations to inform this view if you thought they would give a more nuanced picture.

^{^}

An arguably simpler way to represent ‘small but not miniscule chance of regressing further through a time of perils’ might have been a zipf distribution - essentially a discrete-valued Pareto distribution. I will probably add this as an option at some point, but it turns out to produce similar enough values to an equivalently parametered exponential algorithm, as evidenced on this graph (blue is zipf, green is exponential), that I think it would have very little effect on the calculator’s output vs the exponential algorithm. And for my taste, we know so little about how far relatively minor shocks might cause us to unravel that the somewhat more pessimistic mean algorithm captures my intuition better.

^{^}

If you think a multiplanetary state is irrelevant (e.g. you think AI will lock us in one way or the other), you can set the maximum transition probability to that state as 0 and raise the probability of transitioning directly from a time of perils to an existential security/interstellar state above its default 0 value.

^{^}

Rocky mass in the form of planetoids isn’t strictly a hard limit. A very advanced civilisation could theoretically construct something like O’Neill cylinders - but by the time even those were self-sustaining, it seems likely that we would both have started colonising other solar systems and be about as existentially secure as we would be likely to get.

^{^}

Strictly speaking there are three further parameters in ./calculators/full_cache/runtime_constants.py, but these determine the level of approximation of the potentially infinite Markov chains, and you can ignore them unless you want to adjust the trade-off between precision and run-time.

^{^}

Using the terminology I suggested here.

^{^}

The Wikipedia page describes fossil fuels as having an EROI of ~30, nuclear energy around 75-80, and most renewables below 20 (with photovoltaics between 4-7). This seems to be a highly contentious topic, with at least one paper claiming that EROI is actually higher for renewables. This question is outside the scope of this work, but seems urgent for longtermists to answer if they believe in either a relatively low risk of direct extinction or a relatively high risk of smaller technological regression since it will heavily influence both the number of times we’d be able to re-reach a time-of-perils-level technology and the length of time we’d have to spend in the time of perils before reaching safer states if we did.

Corentin Biteau’s Great Energy Descent series imagines an extreme version of the pessimistic view, in which the decline in EROI is inexorable and irreversible. I assert no insight here, except that a very much weaker version of this claim could still suggest a fragile world, or a world which will be fragile unless/until certain precautions are taken.

^{^}

There’s also the possibility that AGI replaces us with some entity (such as itself) that has consciousness, or some other trait that the user might consider to have moral value. It’s up to you when you choose the parameters to decide whether to account for this possibility in parameters that increase the probability of ‘extinction’, of ‘existential security’, or (perhaps less plausibly) of ‘business-as-usual’.

	Probability of becoming interstellar	Expected value (and therefore cost of extinction) from this state	Cost of transitioning to state, i.e. difference in expected value from current time of perils	Cost of transitioning to state as proportion of the cost of extinction
Current time of perils	0.52	0.52V	0	0
Preindustrial state†	0.23	0.23V	0.29V	0.56
Industrial state†	0.27	0.27V	0.25V	0.48
Future time of perils†	0.28	0.28V	0.24V	0.47
Multiplanetary state†	0.78	0.78V	-0.26V	-0.50

	Probability of becoming interstellar		Expected value (and therefore cost of extinction) from this state		Cost of transitioning to state, i.e. difference in expected value from current time of perils		Cost of transitioning to state as proportion of the cost of extinction
P(essimistic) or O(ptimistic) scenario	P	O	P	O	P	O	P	O
Current time of perils	0.38	0.70	0.38V	0.70V	0	0	0	0
Preindustrial state in first reboot†	0.22	0.59	0.22V	0.59V	0.17V	0.12V	0.44	0.17
Industrial state in first reboot†	0.25	0.69	0.25V	0.69V	0.14V	0.01V	0.35	0.02
Multiplanetary state in our current civilisation†	0.57	0.72	0.57V	0.72V	-0.19V	-0.02V	-0.50	-0.03

	Cost of event (i.e. difference in expected value from current time of perils)		Cost of event as proportion of the cost of extinction
P(essimistic) or O(ptimistic) scenario	P	O	P	O
Non-nuclear great power conflict (based on opportunity cost = counterfactual technological regression narrative)†	$7.3 * 10^{- 3} V$	$1.0 * 10^{- 3} V$	$1.9 * 10^{- 2}$	$1.5 * 10^{- 3}$
Non-nuclear great power conflict (based on narrative of differentially accelerating progress of harmful technologies)†	$4.7 * 10^{- 3} V$	$6.6 * 10^{- 4} V$	$1.2 * 10^{- 2}$	$9.4 * 10^{- 4}$
Counterfactually having averted the Covid pandemic†	$- 1.3 * 10^{- 3} V$	$- 1.1 * 10^{- 4} V$	$- 3.3 * 10^{- 3}$	$- 1.6 * 10^{- 3}$
Counterfactually saving one person’s life†	$- 1.8 * 10^{- 12} V$	$- 1.4 * 10^{- 13} V$	$- 4.7 * 10^{- 12}$	$- 2.0 * 10^{- 13}$

	Probability of becoming interstellar	Expected value (and therefore cost of extinction) from this state	Cost of transitioning to state, i.e. difference in expected value from current time of perils	Cost of transitioning to state as proportion of the cost of extinction
Current time of perils	0.62	0.62V	0	0
Preindustrial state in first reboot†	0.45	0.45V	0.17V	0.27
Industrial state in first reboot†	0.57	0.57V	0.06V	0.09
Multiplanetary state in our current civilisation†	0.73	0.73V	-0.1V	-0.16

	Cost of event (i.e. difference in expected value from base estimate)	Cost of event as proportion of the cost of extinction
Non-nuclear great power conflict (based on opportunity cost = counterfactual technological regression narrative)†	$1.2 * 10^{- 3} V$	$1.9 x 10^{- 3}$
Non-nuclear great power conflict (based on narrative of differentially accelerating progress of harmful technologies)†	$7.5 * 10^{- 3} V$	$1.2 * 10^{- 2}$
Counterfactually having averted the Covid pandemic†	$1.3 * 10^{- 4} V$	$2.1 * 10^{- 4}$
Counterfactually saving one person’s life†	$- 8.9 * 10^{- 13} V$	$- 1.4 * 10^{- 12}$

Two tools for rethinking existential risk

Tl;dr

Introduction

Who are the calculators for, and what questions do they help answer?

But won’t these numbers be arbitrary?

Interpreting the calculators

How to use the simple calculator

Example of output from the simple calculator

How to use the full calculator

Choosing your parameters using Desmos

Milestone contractions and expansions^[21] from post-industrial states

Regressing within a post-industrial state

Milestone regressions and expansions from a multiplanetary state

Going extinct from a pre-perils state

Choosing non-graphical parameters

Examples of output from the full calculator

Examples 1 & 2: my pessimistic and optimistic scenarios

Example 3: David Denkenberger’s assessment

Limitations/development roadmap

1) Model uncertainty

2) AGI

3) Usability

4) More detailed output

5) Runtime

6) Manual function selection

7) Better options to explore present counterfactuality

8) Minimal automated testing 😔

Contribute/submit feature requests

Share your results!

Acknowledgements

Two tools for rethinking existential risk

Tl;dr

Introduction

Who are the calculators for, and what questions do they help answer?

But won’t these numbers be arbitrary?

Interpreting the calculators

How to use the simple calculator

Example of output from the simple calculator

How to use the full calculator

Choosing your parameters using Desmos

Milestone contractions and expansions[21] from post-industrial states

Regressing within a post-industrial state

Milestone regressions and expansions from a multiplanetary state

Going extinct from a pre-perils state

Choosing non-graphical parameters

Examples of output from the full calculator

Examples 1 & 2: my pessimistic and optimistic scenarios

Example 3: David Denkenberger’s assessment

Limitations/development roadmap

1) Model uncertainty

2) AGI

3) Usability

4) More detailed output

5) Runtime

6) Manual function selection

7) Better options to explore present counterfactuality

8) Minimal automated testing 😔

Contribute/submit feature requests

Share your results!

Acknowledgements

Milestone contractions and expansions^[21] from post-industrial states