Grokking “Semi-informative priors over AI timelines”

Grokking “Semi-informative priors over AI timelines”

[anonymous]

17 min read · Jun 12, 2022

Comments 1

Sorted by

New & upvoted

poppinfresh

Thanks for this, I think it deepened my understanding of Tom's model. It looks like a lot of work went into this post and I appreciate you taking the time to make your analysis so intelligible!

Comments

Curated and popular this week

Was Partisanship Good for the Environmental Movement?

Jeffrey Heninger·2y ago·Curated 3d ago·6m read

This is the third in a sequence of posts taken from my recent report: Why Did Environmentalism Become Partisan? Summary Rising partisanship did not make environmentalism more popular or politically effective. Instead, it saw flat or falling overall public opinion, fewer major legislative achievements, and fluctuating executive actions. Public Opinion...

127

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·5d ago·4m read

I think right now EAs might be making a significant mistake by paying insufficient attention to the political realm. As EAs we tend to figure out what’s most impactful for us to work on and focus hard. That’s great! But there are various actions that are ‘non-delegatable’ - the extent to which an individual can do the action is limited (like voting, going to a protest, making hard money contributions to particular campaigns). It might be useful if we were all more in the habit of doing variou...

102

New Video from AI in Context: The Fall and Rise of Sam Altman

ChanaMessinger, phoebe b, Aric Floyd·1w ago·3m read

New Video from AI in Context: The Fall and Rise of Sam Altman If you want to skip straight to the video, here it is! AI in Context is excited to be back with our fourth video! For those just hearing from us, we make videos for 80,000 Hours, telling stories about transformative AI...

^{^}

Green boxes correspond to inputs, red boxes are assumptions or limitations, and blue boxes are classed as “other”.

^{^}

I’ve written a summary of the report as part of this sequence, if you’re interested!

^{^}

One way to think about this is as a distinction between “inside view” and “outside view” approaches (however see also this post). Cotra’s bioanchors report takes an inside view, roughly based on the assumption that training compute is the biggest bottleneck to building TAI, and quantifying how much we’ll need to be able to train a transformative model. Davidson’s semi-informative priors report instead specifies very little about how AI development works, leaning more heavily on reference classes from similar technologies and a general Bayesian framework.

^{^}

This is a variation of the sunrise problem, which was the original problem that Pierre-Simon Laplace was trying to solve.

^{^}

This is of a course a somewhat dubious assumption, and we’ll come back to this later on.

^{^}

Indeed, looking only at the base rate of successful first trials alone would have a big problem of sparsity – there’s just not enough historical data!

^{^}

We could also think about the number of virtual trials rather than virtual successes, but Davidson decides against this. Loosely speaking, if we use virtual trials, then it’s not as easy to separate out the effects of the first-trial probability and the effects from observed failed trials (more).

^{^}

The prior is defined using a Beta distribution parameterised by (1) the number of virtual successes, and (2) the inverse of the first-trial probability. See here for more information.

^{^}

The “plausibility of the prior” focuses on the shape of the Beta distribution, e.g. whether or not you should expect the probability density to be larger in the interval [0, 1/1000] or [1/1000, 2/1000]. On the other hand, the “plausibility of the update” looks at your expected probability of building AGI next year should change given the outcomes of newly observed trials. For example (borrowing from the report), “If you initially thought the annual chance of developing AGI was 1/100, 50 years of failure is not that surprising and it should not reduce your estimate down as low as 1/600”.

^{^}

This approach also applies to researcher-years and compute years, and is described more here.

^{^}

Incidentally, this is a claim that’s central to another of Open Philanthropy’s Worldview Investigations, Forecasting TAI with biological anchors, which I’ve discussed in another post.

^{^}

Note that this doesn’t imply that there’s an infinite probability of developing AGI in the first researcher-year of effort, because it’s not true that we’re starting from the “zero” level of AI technological development. Essentially, the regime start-time is not about “when the level AI technological development started increasing” – see this footnote for more on discussion.

^{^}

For example, we would like our prediction for $P (AGI within 10 years)$ to remain the same even if we use a trial definition of 1 month instead of 1 year. Although using a trial definition of 1 month would ordinarily lead to more total observed trials and thus more updating, this effect is cancelled out by choosing a different first-trial probability.

^{^}

More concretely, suppose you think that several different updates rules (corresponding to e.g. different numbers of virtual successes) all seem reasonable, and you’re uncertain what to do. One approach is to weight the results for the different choices of update rules, and use these rules to update the forecasts based on evidence. But we might also be interested in updating how we weight the update rules, which is where the hyper prior comes in (more).

^{^}

These numbers were extracted using WebPlotDigitizer.

^{^}

Depending on your point of view, this may not be very compelling evidence – e.g. you might think that the ramp up to AGI would be extremely fast due to the discovery of a “secret sauce”.

^{^}

You can also have a look at the full report if you want to get into the details!

	P(AGI by 2036)
Trial definition	Low-end	Central estimate	High-end
Calendar-year	1.5%	4%	9%
Researcher-year	2%	8%	15%
Compute trial	2%	15%	25%

P(AGI by 2030)	P(AGI by 2050)	P(AGI by 2100)
~6%	~11%	~20%

10%	50%	90%
~2044	>2100	>2100

Grokking “Semi-informative priors over AI timelines”

Grokking “Semi-informative priors over AI timelines”

Executive Summary

Motivation

Laplace’s Rule of Succession

Making the priors less uninformative

Semi-informative priors demystified

First-trial probability

Number of virtual successes

Regime start time

Trial definition

Putting things together: Final distribution

Model Extensions

Final Distribution

Conclusion