Hide table of contents

TLDR

We are creating a database to collect base rates for various categories of events. You can find the database here and can suggest new base rate categories for us to look into here.

Project Summary

The base rate database project collects base rates for different categories of events and makes them available to researchers, forecasters and philanthropic organisations. Its main goals are to develop better intuitions about the potential and limitations of reference class forecasting and to provide useful information to the public. The data will enable research that enhances our understanding of the kinds of circumstances in which reference forecasting is a promising approach, what kinds of methods of reference forecasting work best, how to construct reasonable reference classes, and what potential caveats and pitfalls are. In addition to the raw data we will collect qualitative feedback on individual reference classes and on the overall process of building a base rate database, adding context to the data and developing comprehensive knowledge to build upon in the future. We aim to select categories of base rates in a way that makes the information we collect useful to decision makers and philanthropic organisations. 

 

Introduction

If one wants to predict whether some event will happen in the future, it is often helpful to look at the past. One can ask: "Ignoring all the specifics of the current event I'm trying to predict, what would I predict just by looking at the base rate of similar events happening in the past?". This is called reference class forecasting and helps forecasters to obtain an 'outside view' on the forecasting question at hand. This outside view, of course, is usually complemented by the 'inside view': what are the specifics of the current event at hand that distinguish it from other events? 

Reference class forecasting is widely used among forecasters. To this date, however, there has been little systematic research done into how effective base rates are for forecasting future events, how they can best be used and what limitations apply. We aim to facilitate this research. 

 

Project outline

Goal

The main goal of this project is to develop a better understanding of the merits and limitations of reference class forecasting. 

A secondary goal is to collect information that may be useful for forecasters and EA stakeholders in the future. 

 

What we'll do

We want to achieve our goals by

  • asking experienced forecasters to compile a public database with base rates for various categories of events
  • collecting qualitative feedback on the process of collecting base rates, as well as the base rates themselves
  • using the database to conduct and facilitate quantitative and qualitative research, especially with regards to the performance of various reference class forecasting approaches 
  • inviting others (you!) to suggest base rate categories that we should look into through this form

 

Categories that we want to look into

We intend to look into categories as diverse as 

  • Violent and non-violent protests that have (or have not) led to regime change
  • Elections with small margins of victory
  • Zoonotic spillover events
  • Development of new antibiotics
  • ... 

You can find a list of all the categories on our radar here. You can suggest new categories here

 

Specific research questions

The database is meant to be a resource for anyone who is interested in reference class forecasting. Please do feel free to use it for your own research as well as to reach out to us. 

So far, we have thought of the following quantitative analyses we think may be promising: 

  • Comparison of the predictive performance of several reference forecasting approaches, for example:
    • Naive Laplace's rule with different priors (uniform, Jeffrey, Haldane)
    • Time invariant Laplace with different ways of treating the exponent
  • Analysing how useful reference forecasting is overall, for example by
    • constructing a reference class forecast based on the first x observations and scoring the forecast based on the last (n-x) observations
    • arriving at an estimate for the robustness of reference class forecasting by obtaining a distribution of scores for different approaches across different base rate categories  
    • checking how robust forecasts / estimates are to changes in the observation period / the number of data points used. 
    • specifically investigating the relationship between accuracy and the number of data points available by constructing a forecast based on the first X data points and subsequently adding more data points to check consistency. The distribution of robustness would itself provide a base rate for how useful base rates are. 
    • Identifying patterns that make a base rate useful or less useful (e.g. if there is a dynamic over time, simply looking at the base rate may not be enough)

We also aim to obtain a better qualitative understanding of reference class forecasting by asking that forecasters who collect the base rates to reflect on the process as well as the individual base rate categories, for example

  • How clear are criteria for inclusion / exclusion and the period that was looked at
  • How trustworthy is the data?
  • Are there any trends that can be identified? 
  • General thoughts / lessons learned

How you can help

Suggesting new categories

You can suggest new categories to include in the database here. Suggested categories should ideally be at least one of the following: 

  • helpful / useful / interesting
  • easy to collect

Providing feedback

If you have thoughts on anything presented here, please let us know in the comments or get in touch directly.  

 

 

 

Comments8


Sorted by Click to highlight new comments since:

Hey!

FYI, this pattern matches to an elegant EA meta project that sometimes goes wrong. Forecasting is not my domain and I have no object level opinions, just pointing out the potential failure mode just in case.

Upvoted, and I hope this goes well!

FWIW I'm one of the future users of this project and regularly chatting to this team.

My use case is for research, eg validating this approach with empirical data .

I expect this database will be useful in the future as a benchmark to test similar approaches, and the program probably justifies its (low) costs in those grounds alone.

Nice! I defer to your opinion

Love the idea of this! You could potentially turn the main table into a pivot table, to make it easier to filter for things 

Do you mean have the table be in a long-format rather than a wide-format? 

There is an inherent aim in this project to bring to surface salient data that can be recombined and organized for a radically new information-based layer of decision making

But there are some pros and cons to crowd source, you get stability & good work (wikipedia) but your signal-to-noise is bounded by the leaders or organizing principles of legitimacy. there will be factions that develop trying to skew data with some bias, making the overall project less legitimate even though there's transparency.

what is the target size of individuals working on this project?
is there a definite or indefinite timeline or termination of this project?
what can be learned about governance structure to isolate the distractions and optimize the bandwidth of the participants?

Currently there are 4 people (including me) working on the project. I focus on coordination, the other three are professional forecasters and focus on the data collection. At the moment we're aiming for wide feedback from anyone who would be interested in certain base rates, but we're not actively crowd-sourcing the collection process. 

[comment deleted]1
0
0
Curated and popular this week
 ·  · 20m read
 · 
Advanced AI could unlock an era of enlightened and competent government action. But without smart, active investment, we’ll squander that opportunity and barrel blindly into danger. Executive summary See also a summary on Twitter / X. The US federal government is falling behind the private sector on AI adoption. As AI improves, a growing gap would leave the government unable to effectively respond to AI-driven existential challenges and threaten the legitimacy of its democratic institutions. A dual imperative → Government adoption of AI can’t wait. Making steady progress is critical to: * Boost the government’s capacity to effectively respond to AI-driven existential challenges * Help democratic oversight keep up with the technological power of other groups * Defuse the risk of rushed AI adoption in a crisis → But hasty AI adoption could backfire. Without care, integration of AI could: * Be exploited, subverting independent government action * Lead to unsafe deployment of AI systems * Accelerate arms races or compress safety research timelines Summary of the recommendations 1. Work with the US federal government to help it effectively adopt AI Simplistic “pro-security” or “pro-speed” attitudes miss the point. Both are important — and many interventions would help with both. We should: * Invest in win-win measures that both facilitate adoption and reduce the risks involved, e.g.: * Build technical expertise within government (invest in AI and technical talent, ensure NIST is well resourced) * Streamline procurement processes for AI products and related tech (like cloud services) * Modernize the government’s digital infrastructure and data management practices * Prioritize high-leverage interventions that have strong adoption-boosting benefits with minor security costs or vice versa, e.g.: * On the security side: investing in cyber security, pre-deployment testing of AI in high-stakes areas, and advancing research on mitigating the ris
 ·  · 15m read
 · 
In our recent strategy retreat, the GWWC Leadership Team recognised that by spreading our limited resources across too many projects, we are unable to deliver the level of excellence and impact that our mission demands. True to our value of being mission accountable, we've therefore made the difficult but necessary decision to discontinue a total of 10 initiatives. By focusing our energy on fewer, more strategically aligned initiatives, we think we’ll be more likely to ultimately achieve our Big Hairy Audacious Goal of 1 million pledgers donating $3B USD to high-impact charities annually. (See our 2025 strategy.) We’d like to be transparent about the choices we made, both to hold ourselves accountable and so other organisations can take the gaps we leave into account when planning their work. As such, this post aims to: * Inform the broader EA community about changes to projects & highlight opportunities to carry these projects forward * Provide timelines for project transitions * Explain our rationale for discontinuing certain initiatives What’s changing  We've identified 10 initiatives[1] to wind down or transition. These are: * GWWC Canada * Effective Altruism Australia funding partnership * GWWC Groups * Giving Games * Charity Elections * Effective Giving Meta evaluation and grantmaking * The Donor Lottery * Translations * Hosted Funds * New licensing of the GWWC brand  Each of these is detailed in the sections below, with timelines and transition plans where applicable. How this is relevant to you  We still believe in the impact potential of many of these projects. Our decision doesn’t necessarily reflect their lack of value, but rather our need to focus at this juncture of GWWC's development.  Thus, we are actively looking for organisations and individuals interested in taking on some of these projects. If that’s you, please do reach out: see each project's section for specific contact details. Thank you for your continued support as we
 ·  · 3m read
 · 
We are excited to share a summary of our 2025 strategy, which builds on our work in 2024 and provides a vision through 2027 and beyond! Background Giving What We Can (GWWC) is working towards a world without preventable suffering or existential risk, where everyone is able to flourish. We do this by making giving effectively and significantly a cultural norm. Focus on pledges Based on our last impact evaluation[1], we have made our pledges –  and in particular the 🔸10% Pledge – the core focus of GWWC’s work.[2] We know the 🔸10% Pledge is a powerful institution, as we’ve seen almost 10,000 people take it and give nearly $50M USD to high-impact charities annually. We believe it could become a norm among at least the richest 1% — and likely a much wider segment of the population — which would cumulatively direct an enormous quantity of financial resources towards tackling the world’s most pressing problems.  We initiated this focus on pledges in early 2024, and are doubling down on it in 2025. In line with this, we are retiring various other initiatives we were previously running and which are not consistent with our new strategy. Introducing our BHAG We are setting ourselves a long-term Big Hairy Audacious Goal (BHAG) of 1 million pledgers donating $3B USD to high-impact charities annually, which we will start working towards in 2025. 1 million pledgers donating $3B USD to high-impact charities annually would be roughly equivalent to ~100x GWWC’s current scale, and could be achieved by 1% of the world’s richest 1% pledging and giving effectively. Achieving this would imply the equivalent of nearly 1 million lives being saved[3] every year. See the BHAG FAQ for more info. Working towards our BHAG Over the coming years, we expect to test various growth pathways and interventions that could get us to our BHAG, including digital marketing, partnerships with aligned organisations, community advocacy, media/PR, and direct outreach to potential pledgers. We thin
Recent opportunities in Forecasting
20
Eva
· · 1m read