Grokking “Semi-informative priors over AI timelines”

Grokking “Semi-informative priors over AI timelines”

[anonymous]

17 min readJun 12, 2022

Comments 1

Sorted by

New & upvoted

poppinfresh

Thanks for this, I think it deepened my understanding of Tom's model. It looks like a lot of work went into this post and I appreciate you taking the time to make your analysis so intelligible!

Comments

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·1w ago·Curated 3d ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

178

The first video from Giving What We Can's new channel is out now!

JustinPortela·5d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

New round of digital minds funding opportunities at Longview

zdgroff, Longview Philanthropy·6d ago·2m read

This is a linkpost for Request for Proposals: Research and Applied Work on Digital Minds. I'm glad to announce a request for proposals for research and applied work on digital minds at Longview Ph...

^{^}

Green boxes correspond to inputs, red boxes are assumptions or limitations, and blue boxes are classed as “other”.

^{^}

I’ve written a summary of the report as part of this sequence, if you’re interested!

^{^}

One way to think about this is as a distinction between “inside view” and “outside view” approaches (however see also this post). Cotra’s bioanchors report takes an inside view, roughly based on the assumption that training compute is the biggest bottleneck to building TAI, and quantifying how much we’ll need to be able to train a transformative model. Davidson’s semi-informative priors report instead specifies very little about how AI development works, leaning more heavily on reference classes from similar technologies and a general Bayesian framework.

^{^}

This is a variation of the sunrise problem, which was the original problem that Pierre-Simon Laplace was trying to solve.

^{^}

This is of a course a somewhat dubious assumption, and we’ll come back to this later on.

^{^}

Indeed, looking only at the base rate of successful first trials alone would have a big problem of sparsity – there’s just not enough historical data!

^{^}

We could also think about the number of virtual trials rather than virtual successes, but Davidson decides against this. Loosely speaking, if we use virtual trials, then it’s not as easy to separate out the effects of the first-trial probability and the effects from observed failed trials (more).

^{^}

The prior is defined using a Beta distribution parameterised by (1) the number of virtual successes, and (2) the inverse of the first-trial probability. See here for more information.

^{^}

The “plausibility of the prior” focuses on the shape of the Beta distribution, e.g. whether or not you should expect the probability density to be larger in the interval [0, 1/1000] or [1/1000, 2/1000]. On the other hand, the “plausibility of the update” looks at your expected probability of building AGI next year should change given the outcomes of newly observed trials. For example (borrowing from the report), “If you initially thought the annual chance of developing AGI was 1/100, 50 years of failure is not that surprising and it should not reduce your estimate down as low as 1/600”.

^{^}

This approach also applies to researcher-years and compute years, and is described more here.

^{^}

Incidentally, this is a claim that’s central to another of Open Philanthropy’s Worldview Investigations, Forecasting TAI with biological anchors, which I’ve discussed in another post.

^{^}

Note that this doesn’t imply that there’s an infinite probability of developing AGI in the first researcher-year of effort, because it’s not true that we’re starting from the “zero” level of AI technological development. Essentially, the regime start-time is not about “when the level AI technological development started increasing” – see this footnote for more on discussion.

^{^}

For example, we would like our prediction for $P (AGI within 10 years)$ to remain the same even if we use a trial definition of 1 month instead of 1 year. Although using a trial definition of 1 month would ordinarily lead to more total observed trials and thus more updating, this effect is cancelled out by choosing a different first-trial probability.

^{^}

More concretely, suppose you think that several different updates rules (corresponding to e.g. different numbers of virtual successes) all seem reasonable, and you’re uncertain what to do. One approach is to weight the results for the different choices of update rules, and use these rules to update the forecasts based on evidence. But we might also be interested in updating how we weight the update rules, which is where the hyper prior comes in (more).

^{^}

These numbers were extracted using WebPlotDigitizer.

^{^}

Depending on your point of view, this may not be very compelling evidence – e.g. you might think that the ramp up to AGI would be extremely fast due to the discovery of a “secret sauce”.

^{^}

You can also have a look at the full report if you want to get into the details!

	P(AGI by 2036)
Trial definition	Low-end	Central estimate	High-end
Calendar-year	1.5%	4%	9%
Researcher-year	2%	8%	15%
Compute trial	2%	15%	25%

P(AGI by 2030)	P(AGI by 2050)	P(AGI by 2100)
~6%	~11%	~20%

10%	50%	90%
~2044	>2100	>2100

Grokking “Semi-informative priors over AI timelines”

Grokking “Semi-informative priors over AI timelines”

Executive Summary

Motivation

Laplace’s Rule of Succession

Making the priors less uninformative

Semi-informative priors demystified

First-trial probability

Number of virtual successes

Regime start time

Trial definition

Putting things together: Final distribution

Model Extensions

Final Distribution

Conclusion