A model about the effect of total existential risk on career choice

Jonas Moss

Which existential risk cause should you focus on? The cause where you have the largest impact on decreasing total existential risk. That's not the same as working on the cause where you have the largest impact when seen in isolation.

Model

Suppose there are existential risks, each with its probability $p_{k}$ of ending the world. For each cause $k$ you can reduce the probability of the world ending from that cause by $d_{k}$ , but only if you spend your whole career doing it.

For instance, suppose the risks are AI, biorisk, and asteroids. They have associated probabilities $p_{1} = 0.9, p_{2} = 0.10$ and $p_{3} = 0.01$ .^[1] How much could you decrease the probability of extinction for each cause? You're pretty good at deflecting asteroids and killing viruses escaping from labs, but not that good at making humans lovable for AIs. Your probabilities are, say, $d_{3} = 10^{- 7}$ , $d_{2} = 10^{- 7}$ , $d_{3} = 5 \cdot 10^{- 8}$ .

Risk type	Probability ( $p_{k}$ )	Probability reduction ( $d_{k}$ )
AI	0.9	$5 \cdot 10^{- 8}$
Biorisk	0.1	$10^{- 7}$
Asteroids	0.01	$10^{- 7}$

Which career should you choose? It sounds plausible that you should be agnostic between the biorisk and asteroids path. That's where you'll reduce the probability of extinction the most, after all. But we should do a decision-theoretic analysis of the problem to make sure.

Let's use the utility function where the world survives has utility $1$ and the world ceases to exist has utility $0$ . Let $a = (a_{1}, \dots, a_{K})$ be a $0 - 1$ vector with $a_{k} = 1$ if you choose action $k$ and $0$ otherwise. Then you ought to solve the total utility maximization problem

max a \in A K \prod k = 1 (1 - p_{k} + d_{k} a_{k}) .

Why? Because you don't care which event causes extinction, only that it doesn't happen. And the total probability of no extinction equals $\prod_{k = 1}^{K} (1 - p_{k} + d_{k} a_{k})$ .

Anyway, we can show that

the optimal action, i.e., career path, is the one with the highest $d_{k} / (1 - p_{k})$ ;
the multiplicative improvement you're causing by choosing action $k$ is $1 + d_{k} / (1 - p_{k})$ .

Proof

Define

Π = K \prod k = 1 (1 - p_{k}) .

The utility when taking action $k$ equals

Π \frac{1 - p_{k} + d_{k}}{1 - p_{k}} = Π (1 + \frac{d_{k}}{1 - p_{k}}),

which is clearly maximized in $k$ that maximises $d_{k} / (1 - p_{k})$ .

Consequences

You need to take both the probability of extinction by cause $k$ and your ability to reduce the probability into account when you choose your career. If, for instance, the probability of AI ending the world ( $p_{1}$ ) is higher than biorisk ending the world ( $p_{2}$ ), you need to be at least $\frac{1 - p_{1}}{1 - p_{2}}$ times better at biorisk than AI risk (in terms of reducing the probability) to justify working on biorisk. If the probability of bio extinction is $0.01$ and the probability of AI extinction is $0.90$ , you need to be $0.99 / 0.10 = 9.9 \approx 10$ better at biorisk to justify doing biorisk instead of AI.

We can expand the table above to include the benefit of taking each action:

Risk type	Probability ( $p_{k}$ )	Probability reduction ( $d_{k}$ )	Benefit ( $\approx$ )
AI	0.9	$5 \cdot 10^{- 8}$	$1 + 5 \cdot 10^{- 7}$
Biorisk	0.1	$10^{- 7}$	$1 + 1.11 \cdot 10^{- 7}$
Asteroids	0.01	$10^{- 6}$	$1 + 10^{- 7}$

So, the AI safety career is better than the asteroid career. But not by a lot, as the number $(1 + 5 \cdot 10^{- 7}) / (1 + 10^{- 7})$ is virtually indistinguishable from $1$ . But of course, a higher number is a higher number, and they do add up. If we only care about the part $d_{k} / (1 - p_{k})$ , which might be reasonable, doing the AI career is $5$ times better than the asteroids career. Which is more impressive.

A model with uncertainty

So, you say you have epistemic uncertainty about the probabilities of extinction from each cause? Perhaps you think your choice of entering a fiend may remove the risk entirely, not reduce it by a small number? (E.g., either you solve AI alignment, or you don't).

That turns out not to matter. For the problem doesn't change much when you allow for uncertainty. Provided $p_{j}, d_{k}$ , $p_{j}, p_{k}$ and $d_{j}, d_{k}$ are independent when $j \neq k$ we find that

\begin{matrix} {argmax}_{a \in A} E [K \prod k = 1 (1 - p_{k} + d_{k} a_{k})] & = & {argmax}_{a \in A} K \prod k = 1 E [1 - p_{k} + d_{k} a_{k}], = & {argmax}_{a \in A} Π \frac{E (1 - p_{k}) + E (d_{k}) a_{k}}{E (1 - p_{k})}, = & {argmax}_{a \in A} Π [1 + \frac{E (d_{k}) a_{k}}{1 - E (p_{k})}], \end{matrix}

where $Π = \prod_{k = 1}^{K} E (1 - p_{k}) .$ The problem is maximized in the action with the highest $E [d_{k}] / (1 - E [p_{k}]) .$

Footnotes

^{^}
These probabilities sum to more than $1$ , but that doesn't matter for our purposes. Think about them as the probabilities of independent events and the event "the world ends" as an event that happens if at least one of them occurs.

MathiasKB🔸Sep 10 20223

Thanks for writing this Jonas. As someone much below the lesswrong average at math, I would be grateful for a clarification of this sentence:

Provided , $p_{j}, p_{k}$ and $d_{j}, d_{k}$ are independent when $j \neq k$

What does $j$ and $k$ refer to here? Moreover is it a reasonable assumption, that the uncertainties of existential risks are independent? It seems to me that many uncertainties run across risk types, such as chance of recovery after civilisations collapse.

Jonas MossSep 10 20223

and $k$ are indices for the causes. I wrote $j \neq k$ because you don't have to assume that $d_{k}$ and $p_{k}$ are independent for the math to work. But everything else will have to be independent.

Maybe the uncertainties shouldn't be independent, but often they will. Our uncertainty about the probability of AI doom is probably not related to our uncertainty about the probability of pandemics doom, for instance.

JakubKSep 10 20222

I don't understand what is. What do you mean by a "probability reduction" of $5 \cdot 10^{- 8}$ ?

If the probability of extinction by cause is $p_{k}$ and the probability reduction for that cause is $d_{k}$ , the probability of extinction becomes $p_{k} - d_{k}$ if you choose to focus on cause $k$ .

Effective Altruism Forum
EA Forum