Hide table of contents

Disclosure: I’m a technical product manager at Metaculus. Views expressed are my own.

Thanks to Nate Morrison, Ryan Beck, Christian Williams for feedback on this post, and Leonard Smith for clarifying my thinking.

Recently Arvind Narayanan and Sayash Kapoor published “AI existential risk probabilities are too unreliable to inform policy,” which argues, among other things, that forecasts on long-run, unique, or rare events, for systems that are not “purely physical,” should not inform policy. There are a lot of incongruities and fictions in this piece, but here I’ll focus on some of the reasons why I believe assigning probabilities for these kinds of events is useful in general and can indeed be useful for policy.

I argue that assigning probabilities, even to fundamentally uncertain, “unique” events, can help us: 

  1. Communicate about what we expect to happen with more nuance
  2. Reason about things that don’t have clear reference classes
  3. Put our reputation where our mouth is

Keep in mind that a big part of the value of x-risk forecasting exercises for policymakers lies in identifying the precursors to catastrophe, the scenarios to avoid. The details of the distribution of p(doom) forecasts are less important. Instead, we should ask: What is a given policymaker’s “risk appetite?” What constitutes an unacceptably high risk? What can we do that has the best chance of averting disaster? There are signals everywhere for those with the eyes to see.

Communicate about what we expect to happen with more nuance

First, everyone makes claims about the future all the time, whether or not they think of these as forecasts and whether or not the claims relate to something unprecedented, like existential risks posed by AI. Often these claims use phrases like “there is no chance” or “it’s a sure thing” or, in Narayanan and Kapoor’s case, they describe a risk as “negligible.” Different people interpret words very differently, however, which hampers our ability to communicate what we expect to happen.

Source: Heuer, R. J. (1999). Psychology of intelligence analysis. Center for the Study of Intelligence. via SAS blog post by Rick Wicklin. (Heuer asked 23 NATO officers what they thought each of these phrases meant. The shaded areas are the ranges Sherman Kent (1964) proposed that the U.S. intelligence community use.)

Probabilities represent an improvement over that kind of wishy-washy, estimative language. They help us express our beliefs with more clarity and nuance so we can ultimately have better discussions about our policy preferences. For example, maybe you and I disagree about AI x-risk: Say you think the chances are on the order of one in a million, and I’m at 10%. We agree, however, that if there were a hugely disruptive cyberattack enabled by AI that also shut down the U.S. power grid for a day, we would become more concerned! Perhaps then your AI x-risk increases to one in a thousand (a 10x update), and I move slightly up to 10.2%. In this scenario, our updates reflect our underlying forecasts, where you don’t expect (1% probability) an AI-supported cyberattack to disrupt your life in this way, and I do (80% probability). The relative changes in our forecasts illuminate this difference.[1]

Reason about things that don’t have clear reference classes

Importantly, many of the most significant risks we face do not have clear reference classes. Take nuclear escalation risk, for example, which James Acton wrote about for Metaculus last year. Per Acton, forecasting nuclear weapon use today, given the one historical use of nuclear weapons in an armed conflict and the several close calls since then, is about as hard as predicting this year’s election “with no more data than the outcomes of the contests prior to the overhaul of the U.S. electoral system in 1804.”

But, Acton is quick to say, that’s no reason not to try! Part of the value is in the exercise. We might ask ourselves (to use Acton’s example): If Ukraine were to try to take back Crimea, how would that affect our odds that the conflict takes a turn for the nuclear? If Russia were to use nuclear weapons, how would that affect our odds that the war escalates further?

That said, the Forecasting Research Institute (my previous employer) performed a follow-up study to the Existential Risk Persuasion tournament (XPT) called the Adversarial Collaboration on AI Risk, which revealed that much of the disagreement about AI x-risk in particular boiled down to fundamental differences in worldview. That’s no huge surprise nor an indictment of assigning probabilities to x-risk. Our assessment of the risk is a function of the information we have, what we’ve observed about the world, and the meaning we’ve made out of it. So of course that will vary widely from person to person, even within-group. The wisdom of the crowd is like an ensemble model, combining many people’s estimates into one.

On AI x-risk, Narayanan and Kapoor cite the fact that “we might get different numbers if the [XPT] tournament were repeated today” as a reason to distrust x-risk numbers. On the contrary, this is a feature, not a bug! Part of forecasting’s value is tracking changes over time in the crowd’s risk assessment. On the Russo-Ukrainian War, Metaculus published the question Will Russia invade Ukrainian territory before 2023 on December 11, 2021. In the following weeks, Foreign Policy wrote “Past attacks suggest Moscow probably won’t move on Ukraine” (December 15). On January 28, Biden said it was a “distinct possibility” (January 28, 2022, BBC). Al Jazeera published “No, Russia Will Not Invade Ukraine” (February 9, 2022). How should one synthesize these bits of information? It’s not at all clear.

By contrast, by the time 100 forecasters had given their input, on December 15, the community prediction was 34%. On January 28, the community prediction was 45%. On February 11, the community prediction climbed to 70% overnight. The invasion happened on February 24th.

Will Russia invade Ukrainian territory before 2023? on Metaculus. Resolved "yes" on February 24, 2022.

When we’re estimating the risk of an extinction-causing asteroid impact (or another purely physical system) in the next hundred, thousand, ten thousand years, we can probably all agree on what base rate to look at, and furthermore, our risk estimate is unlikely to change very much as we gather more information. However, while a robust case for very unlikely events like extinction-causing asteroid impacts is all well and good, how should we allocate resources between potential risks such as anticipating (and, if possible, preventing) asteroid impacts vs averting climate disaster vs mitigating risks from AI? I argue that a robust case for a very unlikely event is less decision-relevant than a rich body of speculative cases for larger-scale risks like those posed by AI.

And as new evidence rolls in — this can be benchmarks surpassed, new legislation, developments in Taiwan — we update. In this way, AI x-risk is not so different from the risk of escalation in the Russo-Ukrainian war. Registering what we expect to happen is valuable, and so is registering how we would update if something were to happen, like China invading Taiwan. How would it change your assessment of AI x-risk if China seized control of chip manufacturing in Taiwan?[2]

Put our reputation where our mouth is

Yes, people’s forecasts on x-risk run the gamut of the probability scale, but that’s to be expected! There are a whole host of things at play here, not least of which is the probability that world governments intervene. We have much to gain by homing in on the key uncertainties so that we can work toward better benchmarks for these, agree on red-lines, and imagine scenarios. Further, there is value to tracking who is consistently wrong on shorter-run AI topics.

If someone is consistently surprised by the pace of AI progress, maybe we shouldn’t trust them when they say AI is all hype. Inasmuch as Metaculus is consistently “right” (or righter than others) about the pace of AI progress, it seems like a good indicator that the community collectively understands the mechanisms at play here pretty well… i.e. that their models are pretty good!

Reference classes for AI x-risk like the extinction of less-intelligent species might seem outlandish today, but we’re already starting to see whose models predict AI advancements and consequences better. Is it more like the arrival of search engines, which save us time and don’t result in the extinction of the human race — or is it more like the advent of nuclear weapons, i.e., a threat requiring a global response and coordination? Or is it something in between?

When we’re talking about existential risks, we’ve got to be able to be more specific than “almost certainly not” or “very unlikely” or in Kapoor’s words, “not urgent or serious.” That’s what probabilities are for. If you’re someone who’s especially bullish or bearish on risks posed by AI, surely there is something that, were it to happen, would make you re-examine your view. Odds are there’s a Metaculus question for you!

Parting thoughts

Speculation is proposing outcomes, pathways, onramps to better or worse futures. Assigning probabilities to those outcomes, I argue, is different. When we lack numerical models, we don’t have to throw our hands up and… well, what would we do? We all have implicit models about what track the world is on, what’s worth worrying about, what requires intervention. Narayanan and Kapoor clearly have their own models of what’s going to happen with AI. Their model says restricting AI development would increase x-risk from AI (which invites the question, “by how much?”) and that there exist policies that are “compatible with a range of possible estimates of AI risk” (which invites the question, “what range?”).

Probabilistic predictions aren’t a substitute for scenario planning or qualitative predictions; rather, they’re a set of tools for augmenting discourse about what’s coming and what we should do about it. There is certainly a cost to policymakers’ intervening in AI development. I’m not here to take a stance on what the right course of action is. But I do believe that there is value to expressing our beliefs as probabilities and reasoning through short- and long-term risks — from asteroids to wars to AI.

  1. ^

    The Forecasting Research Institute’s report “Roots of Disagreement on AI Risk: Potential and Pitfalls of Adversarial Collaboration” identifies some such cruxes between people concerned about AI x-risk and skeptics of AI x-risk, as well as places where they’re in agreement on longer-run negative effects of AI on humans.

  2. ^

    Taiwan Semiconductor Manufacturing Company (TSMC) currently controls 61.7% of market share in the global semiconductor foundry market (Statista).

Comments3


Sorted by Click to highlight new comments since:

Executive summary: Assigning probabilities to uncertain future events, including AI existential risks, is valuable for communication, reasoning, and accountability, and can inform policy decisions despite limitations.

Key points:

  1. Probabilities allow for more nuanced communication about expectations compared to vague language.
  2. Probabilistic reasoning is useful even for events without clear reference classes, like AI risks or nuclear escalation.
  3. Tracking changes in probability estimates over time provides valuable information as new evidence emerges.
  4. Assigning probabilities helps identify key uncertainties and scenarios to avoid, which is crucial for policymakers.
  5. Probabilistic forecasts allow for accountability by tracking who consistently makes accurate predictions about AI progress.
  6. Expressing beliefs as probabilities complements other tools like scenario planning and qualitative predictions for policy discussions.

 

 

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.

Hey Molly,

Thanks for the post! 
Definitely agree that it would be valuable to explicate our models of the world, especially numerically!

Heads up that you have a repeat paragraph immediately after the probability graph for the Russian invasion of Ukraine.

"When we’re estimating the risk of an extinction-causing asteroid impact..."

Thanks Ivan! More work on actually articulating our models of the world: incoming...! I don't think point forecasts accomplish this very well.

Curated and popular this week
Paul Present
 ·  · 28m read
 · 
Note: I am not a malaria expert. This is my best-faith attempt at answering a question that was bothering me, but this field is a large and complex field, and I’ve almost certainly misunderstood something somewhere along the way. Summary While the world made incredible progress in reducing malaria cases from 2000 to 2015, the past 10 years have seen malaria cases stop declining and start rising. I investigated potential reasons behind this increase through reading the existing literature and looking at publicly available data, and I identified three key factors explaining the rise: 1. Population Growth: Africa's population has increased by approximately 75% since 2000. This alone explains most of the increase in absolute case numbers, while cases per capita have remained relatively flat since 2015. 2. Stagnant Funding: After rapid growth starting in 2000, funding for malaria prevention plateaued around 2010. 3. Insecticide Resistance: Mosquitoes have become increasingly resistant to the insecticides used in bednets over the past 20 years. This has made older models of bednets less effective, although they still have some effect. Newer models of bednets developed in response to insecticide resistance are more effective but still not widely deployed.  I very crudely estimate that without any of these factors, there would be 55% fewer malaria cases in the world than what we see today. I think all three of these factors are roughly equally important in explaining the difference.  Alternative explanations like removal of PFAS, climate change, or invasive mosquito species don't appear to be major contributors.  Overall this investigation made me more convinced that bednets are an effective global health intervention.  Introduction In 2015, malaria rates were down, and EAs were celebrating. Giving What We Can posted this incredible gif showing the decrease in malaria cases across Africa since 2000: Giving What We Can said that > The reduction in malaria has be
Rory Fenton
 ·  · 6m read
 · 
Cross-posted from my blog. Contrary to my carefully crafted brand as a weak nerd, I go to a local CrossFit gym a few times a week. Every year, the gym raises funds for a scholarship for teens from lower-income families to attend their summer camp program. I don’t know how many Crossfit-interested low-income teens there are in my small town, but I’ll guess there are perhaps 2 of them who would benefit from the scholarship. After all, CrossFit is pretty niche, and the town is small. Helping youngsters get swole in the Pacific Northwest is not exactly as cost-effective as preventing malaria in Malawi. But I notice I feel drawn to supporting the scholarship anyway. Every time it pops in my head I think, “My money could fully solve this problem”. The camp only costs a few hundred dollars per kid and if there are just 2 kids who need support, I could give $500 and there would no longer be teenagers in my town who want to go to a CrossFit summer camp but can’t. Thanks to me, the hero, this problem would be entirely solved. 100%. That is not how most nonprofit work feels to me. You are only ever making small dents in important problems I want to work on big problems. Global poverty. Malaria. Everyone not suddenly dying. But if I’m honest, what I really want is to solve those problems. Me, personally, solve them. This is a continued source of frustration and sadness because I absolutely cannot solve those problems. Consider what else my $500 CrossFit scholarship might do: * I want to save lives, and USAID suddenly stops giving $7 billion a year to PEPFAR. So I give $500 to the Rapid Response Fund. My donation solves 0.000001% of the problem and I feel like I have failed. * I want to solve climate change, and getting to net zero will require stopping or removing emissions of 1,500 billion tons of carbon dioxide. I give $500 to a policy nonprofit that reduces emissions, in expectation, by 50 tons. My donation solves 0.000000003% of the problem and I feel like I have f
LewisBollard
 ·  · 8m read
 · 
> How the dismal science can help us end the dismal treatment of farm animals By Martin Gould ---------------------------------------- Note: This post was crossposted from the Open Philanthropy Farm Animal Welfare Research Newsletter by the Forum team, with the author's permission. The author may not see or respond to comments on this post. ---------------------------------------- This year we’ll be sharing a few notes from my colleagues on their areas of expertise. The first is from Martin. I’ll be back next month. - Lewis In 2024, Denmark announced plans to introduce the world’s first carbon tax on cow, sheep, and pig farming. Climate advocates celebrated, but animal advocates should be much more cautious. When Denmark’s Aarhus municipality tested a similar tax in 2022, beef purchases dropped by 40% while demand for chicken and pork increased. Beef is the most emissions-intensive meat, so carbon taxes hit it hardest — and Denmark’s policies don’t even cover chicken or fish. When the price of beef rises, consumers mostly shift to other meats like chicken. And replacing beef with chicken means more animals suffer in worse conditions — about 190 chickens are needed to match the meat from one cow, and chickens are raised in much worse conditions. It may be possible to design carbon taxes which avoid this outcome; a recent paper argues that a broad carbon tax would reduce all meat production (although it omits impacts on egg or dairy production). But with cows ten times more emissions-intensive than chicken per kilogram of meat, other governments may follow Denmark’s lead — focusing taxes on the highest emitters while ignoring the welfare implications. Beef is easily the most emissions-intensive meat, but also requires the fewest animals for a given amount. The graph shows climate emissions per tonne of meat on the right-hand side, and the number of animals needed to produce a kilogram of meat on the left. The fish “lives lost” number varies significantly by
Recent opportunities in Policy
20
Eva
· · 1m read