Hi All
My first post here - please be kind!
I've recently been listening to and reading various arguments around EA topics that rely heavily on reasoning about a loosely defined notion of "expected value". Coming from a mathematics background, and having no grounding at all in utilitarianism, this has left me wondering why expected value is used as the default decision rule in evaluating (speculative, future) effects of EA actions even when the probabilities involved are small (and uncertain).
I have seen a few people here refer to Pascal's mugging, and I agree with that critique of EV. But it doesn't seem to me you need to go anywhere near that far before things break down. Naively (?), I'd say that if you invest your resources in an action with a 1 in a million chance of saving a billion lives, and a 999,999 in a million chance of having no effect, then (the overwhelming majority of the time) you haven't done anything at all. It only works if you can do the action many many times, and get an independent roll of your million-sided dice each time. To take an uncontroversial example, if the lottery was super-generous and paid out £100 trillion, but I had to work for 40 years to get one ticket, the EV of doing that might be, say, £10 million. But I still wouldn't win. So I'd actually get nothing if I devoted my life to that. Right...?
I'm hoping the community here might be able to point me to something I could read, or just tell me why this isn't a problem, and why I should be motivated by things with low probabilities of ever happening but high expected values.
If anyone feels like humoring me further, I also have some more basic/fundamental doubts which I expect are just things I don't know about utilitarianism, but which often seem to be taken for granted. At the risk of looking very stupid, here are those:
- Why does anyone think human happiness/wellbeing/flourishing/worth-living-ness is a numerical quantity? Based on subjective introspection about my own "utility" I can identify some different states I might be in, and a partial order of preference (I prefer to feel contented than to be in pain but I can't rank all my possible states), but the idea that I could be, say 50% happier seems to have no meaning at all.
- If we grant the first thing - that we have numerical values associated with our wellbeing - why does anyone expect there to be a single summary statistic (like a sum total, or an average) that can be used to combine everyone's individual values to decide which of two possible worlds is better? There seems to be debate about whether "total utilitarianism" is right, or whether some other measure is better. But why should there be such a measure at all?
- In practice, even if there is such a statistic, how does one use it? It's hard to deny the obviousness of "two lives saved is better than one", but as soon as you move to trying to compare unlike things it immediately feels much harder and non-obvious. How am I supposed to use "expected value" to actually compare, in the real world, certainly saving 3 lives with a 40% change of hugely improving the educational level of ten children (assuming these are the end outcomes - I'm not talking about whether educating the kids saves more lives later or something)? And then people want to talk about and compare values for whole future worlds many years hence - wow.
I have a few more I'm less certain about, but I'll stop for now and see how this lands. Cheers for reading the above. I'll be very grateful for explanations of why I'm talking nonsense, if you have the time and inclination!
Hey Ben, I think these are pretty reasonable questions and do not make you look stupid.
On Pascal's mugging in particular, I would consider this somewhat informal answer: https://nintil.com/pascals-mugging/ Though honestly, I don't find this super satisfactory, and it is something and still bugs me.
Having said that, I don't think this line of reasoning is necessary for answering your more practical questions 1-3.
Utilitarianism (and Effective Altruism) don't require that there's some specific metaphysical construct that is numerical and corresponds to human happiness. The utilitarian claim is just that some degree of quantification is, in principle, possible. The EA claim is that attempting to carry out this quantification leads to good outcomes, even if it's not an exact science.
GiveWell painstakingly compiles cost-effectiveness estimates numerical, but goes on to state that they don't view these as being "literally true". These estimates still end up being useful for comparing one charity relative to another. You can read more about this thinking here: https://blog.givewell.org/2017/06/01/how-givewell-uses-cost-effectiveness-analyses/
In practice, GiveWell makes all sorts of tradeoffs to attempt to compare goods like "improving education", "lives saved" or "increasing income". Sometimes this involves directly asking the targeted populations about their preferences. You can read more about their approach here: https://www.givewell.org/how-we-work/our-criteria/cost-effectiveness/2019-moral-weights-research
Finally, in the case of existential-risk, it's often not necessary to make these kinds of specific calculations at all. By one estimate, the Earth alone could support something like 10^16 human lives, and the universe could support somewhere something like 10^34 human life-years, or up to 10^56 "cybernetic human life-years". This is all very speculative, but the potential gains are so large that it doesn't matter if we're off by 40%, or 40x. https://en.wikipedia.org/wiki/Human_extinction#Ethics
Returning to the original point, you might ask if work on x-risk is then a case of Pascal's Mugging? Toby Ord gives the odds of human extinction in the next century at around 1/6. That's a pretty huge chance. We're much less confident what the odds are of EA preventing this risk, but it seems reasonable to think that it's some normal number. I.e. much higher than 10^-10. In that case, EA has huge expected value. Of course that might all seem like fuzzy reasoning, but I think there's a pretty good case to be made that our odds are not astronomically low. You can see one version of this argument here: https://slatestarcodex.com/2015/08/12/stop-adding-zeroes/
Thank you - this is all very interesting. I won't try to reply to all of it, but just thought I would respond to agree on your last point. I think x-risk is worth caring about precisely because the probability seems to be in the "actually might happen" range. (I don't believe at all that anyone knows it's 1/6 vs. 1/10 or 1/2, but Toby Ord doesn't claim to either does he?) It's when you get to the "1 in a million but with a billion payoff" range I start to get skeptical, because then the thing in question actually just won't happen, barring many plays of the game.