Sharing what context I’m able to: Our work in this space so far has mostly been around assessing both risks and effective safeguards for AI-biorisk and AI-cybersecurity risk.
Superforecasters and domain experts tend to be relatively aligned on these topics so far (e.g., see this study as one example, and a later update here). (We’ve completed private research on AI-cybersecurity risk and will be publishing some of it soon.)
[Relevant context/COI: I'm CEO at the Forecasting Research Institute (FRI), an organization which I co-founded with Phil Tetlock and others. Much of the below is my personal perspective, though it is informed by my work. I don't speak for others on my team. I’m sharing an initial reply now, and our team at FRI will share a larger post in future that offers a more comprehensive reflection on these topics.]
Thanks for the post — I think it's important to critically question the value of funds going to forecasting, and this post offers a good opportunity for reflection and discussion.
In brief, I share many of your concerns about forecasting and related research, but I'm also more positive on both its impact so far and its future expected impact.
A summary of some key points:
More detail on some select points below. This comment already got very long (!), so I’ll reserve more elaboration for a future, more comprehensive post.
Forecasting research has informed some very important decisions. Unfortunately, many of the details of the relevant evidence here cannot be made public. However, there is evidence of substantial public citation of this research, and some public evidence of affecting particular decisions.
A few examples of relevant impact include:
Some examples of more diffuse impacts — e.g., impact on public understanding of AI and research for policymakers or philanthropists, include:
For context: FRI has been operating for a little over 3 years, and we're accumulating substantially more momentum in terms of connections to top decision-makers as time goes on.
(To be clear: I am mostly discussing FRI here since it’s what I’m most familiar with.)
Forecasts about AI timelines and risk have had major effects on people’s career decisions and the broader AI discourse. AI 2027 underlies popular YouTube videos, 80,000 Hours advises people on career decisions based on timelines forecasts, Dario Amodei’s “country of geniuses in a datacenter by 2027” forecast informs a lot of Anthropic’s work and policy outreach, the AI Impacts survey on AI researchers’ forecasts of existential risk is highly cited, etc.
A major reason I got into this field is that many people are making very intense claims about the effect that AI will have on the world soon, and I want to bring as much rigor and reflection as possible to those claims. So far, it looks like most forecasters are substantially underestimating AI capabilities progress (with some exceptions, e.g. on uplift studies); the evidence on forecasts about AI adoption, societal impacts, and risk is less clear, but I expect we will have more evidence soon, particularly from the Longitudinal Expert AI Panel (LEAP), especially as some forecasters are predicting transformative change in the next few years.
As the expected impact and timing of AI progress is sharpened and clarified, talent and money can be allocated more efficiently.
Case study: Economic impacts of AI
In some cases, it looks to me like forecasting research is picking relatively low-hanging fruit.
The economic impact of AI is a prominent topic of public discussion right now, and it is likely that governments will spend many billions of dollars to address it in the coming years.
Currently, economists hold major sway in public policy about the economic impacts of AI. Perhaps you think top economists, as a group, are badly mistaken about the likely near-term impacts of AI, as some Epoch researchers and others believe. Perhaps you think they are likely to be fairly accurate, as Tyler Cowen, Séb Krier, or typical economists believe. It seems like a valuable common sense intervention to at least document what various groups believe, so that when we are making economic policy going forward we can rely on that evidence to determine who is trustworthy. I believe that studies like this one (and its follow-ups) will be the clearest evidence on the topic.
When thinking about the impact and cost-effectiveness of forecasting, I think it’s more appropriate to compare this work to public goods-oriented research organizations (e.g., Our World in Data, Epoch, etc.) and policy-oriented think-tank research (e.g. GovAI, IAPS, CSET, etc.).
I’ve been disappointed by most impact evaluation of think-tanks and public goods-oriented research that I’ve seen. I believe this is partly because it is very difficult to quantify the impact of this type of work because it has diffuse benefits. But, I still think it’s possible to do better and I would like FRI to do better on this front going forward.
That said, I still believe there are reasonable heuristics for why this research area could be highly cost-effective. There are many billions of dollars of philanthropic and government capital being spent on AI policy topics. If there is a meaningful indication that forecasting is changing people’s views on these questions (as I believe there is; see discussion above), it seems reasonable to me to spend a very small fraction of that capital on getting more epistemic clarity.
Forecasting research, and FRI’s research in particular, still has major areas for improvement.
Examples of a few key issues:
I will save other thoughts on how forecasting, and FRI’s research, could be made more useful to decision-makers for a future post.
But, to be clear: I have a lot of genuine uncertainty about whether forecasting research will be sufficiently impactful going forward. There are promising signs, and increasing momentum, but to more fully deliver on its promise, more improvements will be necessary.
On the value of FRI-style forecasting research in particular:
Finally, there are a few factors that have the potential to dramatically change the field going forward:
Thanks, Ryan, this is great. These are the kinds of details we are hoping for in order to inform future operationalizations of “AI takeover” and “existential catastrophe” questions.
For context: We initially wanted to keep our definition of “existential catastrophe” closer to Ord’s definition, but after a few interviews with experts and back-and-forths we struggled to get satisfying resolution criteria for the “unrecoverable dystopia” and (especially) “destruction of humanity’s longterm potential” aspects of the definition. Our ‘concerned’ advisors thought the “extinction” and “unrecoverable collapse” parts would cover enough of the relevant issues and, as we saw in the forecasts we’ve been discussing, it seems like it captured a lot of the risk for the ‘concerned’ participants in this sample. But, we’d like to figure out better operationalizations of “AI takeover” or related “existential catastrophes” for future projects, and this is helpful on that front.
Broadly, it seems like the key aspect to carefully operationalize here is “AI control of resources and power.” Your suggestion here seems like it’s going in a helpful direction:
“One way to operationalize this is that if the AIs in control wanted to kill or torture humans, they could easily do so.”
We’ll keep reflecting on this, and may reach out to you when we write “takeover”-related questions for our future projects and get into the more detailed resolution criteria phase.
Thanks for taking the time to offer your detailed thoughts on the outcomes you’d most like to see forecasted.
Hi Ryan,
Thanks for the comment!
Regarding “extinction”:
Just to be clear, the primary outcome we looked at (after considering various definitions and getting agreement from some key ‘concerned’ people) was “existential catastrophe,” defined as either extinction or “unrecoverable collapse,” with the latter defined as “(a) a global GDP of less than $1 trillion annually in 2022 dollars for at least a million years (continuously), beginning before 2100; or (b) a human population remaining below 1 million for at least a million years (continuously), beginning before 2100.”
However, we also sanity checked (see p. 14) our findings by asking about the probability that more than 60% of humans would die within a 5-year period before 2100. The median concerned participant forecasted 32%, and the median skeptic forecasted 1%. So, this outcome was considered much more likely by skeptics (median of 1% vs. 0.12% for existential catastrophe). But, a very large gap between the groups still existed. And it also did not seem that focusing on this alternative outcome made a major difference to crux rankings when we collected a small amount of data on it. So, for the most part we focus on the “existential catastrophe” outcome and expect that most of the key points in the debate would still hold for somewhat less extreme outcomes (with the exception of the debate about how difficult it is to kill literally everyone, though that point is relevant to at least people who do argue for high probabilities on literal extinction).
We also had a section of the report ("Survey on long-term AI outcomes") where we asked both groups to consider other severe negative outcomes such as major decreases in human well-being (median <4/10 on an "Average Life Evaluation" scale) and 50% population declines.
Do you have alternative “extremely bad” outcomes that you wish had been considered more?
Regarding “displacement” (footnote 10 on p. 6 for full definition):
We added this question in part because some participants and early readers wanted to explore debates about “AI takeover,” since some say that is the key negative outcome they are worried about rather than large-scale death or civilizational collapse. However, we found this difficult to operationalize and agree that our question is highly imperfect; we welcome better proposals. In particular, as you note, our operationalization allows for positive ‘displacement’ outcomes where humans choose to defer to AI advisors and is ambiguous in the ‘AI merges with humans’ case.
Your articulations of extremely advanced AI capabilities and energy use seem useful to ask about also, but do not directly get at the “takeover” question as we understood it.
Nevertheless, our existing ‘displacement’ question at least points to some major difference in world models between the groups, which is interesting even if the net welfare effect of the outcome is difficult to pin down. A median year for ‘displacement’ (as currently defined) of 2045 for the concerned group vs. 2450 for the skeptics is a big gap that illustrates major differences in how the groups expect the future to play out. This helped to inspire the elaboration on skeptics’ views on AI risk in the “What long-term outcomes from AI do skeptics expect?” section.
Finally, I want to acknowledge that one of the top questions we wished we asked related to superintelligent-like AI capabilities. We hope to dig more into this in follow-up studies and will consider the definitions you offered.
Thanks again for taking the time to consider this and propose operationalizations that would be useful to you!
(Below written by Peter in collaboration with Josh.)
It sounds like I have a somewhat different view of Knightian uncertainty, which is fine—I’m not sure that it substantially affects what we’re trying to accomplish. I’ll simply say that, to the extent that Knight saw uncertainty as signifying the absence of “statistics of past experience,” nuclear war strikes me as pretty close to a definitional example. I think we make the forecasting challenge easier by breaking the problem into pieces, moving us closer to risk. That’s one reason I wanted to add conventional conflict between NATO and Russia as an explicit condition: NATO has a long history of confronting Russia and, by and large, managed to avoid direct combat.
By contrast, the extremely limited history of nuclear war does not enable us to validate any particular model of the risk. I fear that the assumptions behind the models you cite may not work out well in practice and would like to see how they perform in a variety of as-similar-as-possible real world forecasts. That said, I am open to these being useful ways to model the risk. Are you aware of attempts to validate these types of methods as applied to forecasting rare events?
On the ignorance prior:
I agree that not all complex, debatable issues imply probabilities close to 50-50. However, your forecast will be sensitive to how you define the universe of "possible outcomes" that you see as roughly equally likely from an ignorance prior. Why not define the possible outcomes as: one-off accident, containment on one battlefield in Ukraine, containment in one region in Ukraine, containment in Ukraine, containment in Ukraine and immediately surrounding countries, etc.? Defining the ignorance prior universe in this way could stack the deck in favor of containment and lead to a very low probability of large-scale nuclear war. How can we adjudicate what a naive, unbiased description of the universe of outcomes would be?
As I noted, my view of the landscape is different: it seems to me that there is a strong chance of uncontrollable escalation if there is direct nuclear war between Russia and NATO. I agree that neither side wants to fight a nuclear war—if they did, we’d have had one already!— but neither side wants its weapons destroyed on the ground either. That creates a strong incentive to launch first, especially if one believes the other side is preparing to attack. In fact, even absent that condition, launching first is rational if you believe it is possible to “win” a nuclear war, in which case you want to pursue a damage-limitation strategy. If you believe there is a meaningful difference between 50 million dead and 100 million dead, then it makes sense to reduce casualties by (a) taking out as many of the enemy’s weapons as possible; (b) employing missile defenses to reduce the impact of whatever retaliatory strike the enemy manages; and (c) building up civil defenses (fallout shelters etc.) such that more people survive whatever warheads survive (a) and (b). In a sense “the logic of nuclear war” is oxymoronic because a prisoner’s dilemma-type dynamic governs the situation such that, even though cooperation (no war) is the best outcome, both sides are driven to defect (war). By taking actions that seem to be in our self-interest we ensure what we might euphemistically call a suboptimal outcome. When I talk about “strategic stability,” I am referring to a dynamic where the incentives to launch first or to launch-on-warning have been reduced, such that choosing cooperation makes more sense. New START (and START before it) attempts to boost strategic stability by establishing nuclear parity (at least with respect to strategic weapons). But its influence has been undercut by other developments that are de-stabilizing.
Thank you again for the thoughtful comments, and I’m happy to engage further if that would be clarifying or helpful to future forecasting efforts.
Thanks for the reply and the thoughtful analysis, Misha and Nuño, and please accept our apologies for the delayed response. The below was written by Peter in collaboration with Josh.
First, regarding the Rodriguez estimate, I take your point about the geometric mean rather than arithmetic mean and that would move my probability of risk of nuclear war down a bit — thanks for pointing that out. To be honest, I had not dug into the details of the Rodriguez estimate and was attempting to remove your downward adjustment from it due to "new de-escalation methods" since I was not convinced by that point. To give a better independent estimate on this I'd need to dig into the original analysis and do some further thinking of my own. I'm curious: How much of an adjustment were you making based on the "new de-escalation methods" point?
Regarding some of the other points:
Peter says: No, I live in Washington, DC a few blocks from the White House, and I’m not suggesting evacuation at the moment because I think conventional conflict would precede nuclear conflict. But if we start trading bullets with Russian forces, odds of nuclear weapons use goes up sharply. And, yes, I do believe risk is higher in Europe than in the United States. But for the moment, I’d happily attend a conference in London.
The bulk of our funding has gone toward AI-focused forecasting projects (e.g. LEAP, AI-biorisk, economic effects of AI) or ‘automating forecasting research’-type work that has the ultimate goal of assisting decisionmakers (e.g. ForecastBench), so I think this is most of what FRI should be evaluated on.
I’m not sure what comparison class people had in mind previously, but I agree it seems broadly correct to consider this work alongside other AI-related funding opportunities. As noted above, I’d argue that it is appropriate and valuable to have “AI measurement” as an important funding domain alongside areas like “AI governance,” “Technical AI safety research,” “AI field-building,” etc. It seems valuable for one part of the AI grantmaking portfolio to be generating evidence that can be used to sharpen views on AI timelines, to assess risk in various domains (bio, cyber, catastrophic risk), to assess magnitudes of benefits (for calibrating cost-benefit analyses on policies), and to predict the likelihood and impact of various policies (e.g. the effectiveness of DNA synthesis screening for biorisk), etc. This type of fundamental research can inform and support more effective action in the other domains.
I also think forecasting research can have direct impacts on AI governance via direct decision-making partnerships like I described above: i.e., directly partnering with and advising important government agencies and frontier AI companies, among others, on high-stakes decisions related to AI regulation, implementing effective safeguards to reduce AI-cyber risk, and more. We have already seen some early impacts along these lines, as previously mentioned.
I agree. Due to confidentiality, we have primarily shared details of our impact case studies with our funders and had them assess the value of the impact we are making. Establishing evidence of impact publicly is more challenging due to confidentiality considerations. But elsewhere in the thread people have mentioned citations as one reasonable metric for evidence of impact for research organizations that have more diffuse impacts. We have targets for growing our prominent citations over time to assess our impact, and I’ve shared examples of prominent citations to FRI research in my comment above. I also hope that over time, we can share more case studies publicly and provide more of the reasoning for why we believe we had an impact and whether it was positive. The benchmarks RFP case study described above is one example that can be discussed relatively publicly.
I broadly agree on these points. We are running longitudinal expert panels, partnering with important institutions to improve their decision-making, and automating forecasting research, so I see our work as distinct from online betting/forecasting platforms.