Roodman's Thoughts on Biological Anchors

lukeprog

Roodman's Thoughts on Biological Anchors

lukeprog

2 min read · Sep 14, 2022

Comments 8

Sorted by

New & upvoted

Zach Stein-Perlman

I disagree with Roodman's criticism quoted here. Cotra's approach involves estimating that there's an X% chance that the first achievable TAI will look like A, a Y% chance like B, and so on. Some anchors (e.g., short-horizon neural network and long-horizon neural network) are obviously incompatible; whatever the future looks like, they won't both describe the first achievable TAI. Multiplying them is clearly not meaningful; Roodman's proposed "restriction that the various frameworks agree" makes no sense. (Multiplying them would be correct if Cotra's different anchors represented something like different information-sources on necessary-and-sufficient-conditions-for-TAI, but that's not what her anchors represent.)

(I suspect I may be missing something.)

Will Aldred

Roodman's proposed "restriction that the various frameworks agree" makes no sense.

I'm with you. I think Roodman must disagree with the idea of giving probabilies to different (and necessarily conflicting) models of the world, but to me this seems like an odd position to hold. I might also be missing something.

Daniel_Eth

Agree with what you're saying. This part of the review in particular stood out to me:

In pure Bayesian reasoning, if one has several uncertain measurements of the same value, each represented by a probability distribution...

Since Cotra isn't presenting the different anchors as all-things-considered estimates, but instead more like different hypotheses. Consider the evolutionary anchor – Cotra could have divided the compute requirements in this anchor by a scaling factor for how much more efficient she believes human-directed SGD (or similar) will be compared to how efficient evolution was at finding intelligence, yielding an all-things-considered estimate of how much compute will be necessary for TAI, but instead she leaves the value as is and considers it a soft upper bound.

JoshuaBlake

The approach of Cotra criticised here could be interpreted as Bayesian model averaging I think. This seems fine, maybe Roodman disagrees, but I think he needs to expand a bit.

Michael St Jules 🔸

If we’re looking at AI meeting any of multiple thresholds, you could take the minimum of the random variables representing the dates the thresholds are passed. If it's supposed to meet all, you'd take the maximum. You could pick subsets of dates to take mins or maxes of to do this with, and mix probabilistically by sampling each distribution.

(Maybe this was already done? It's been a while since I thought about the report.)

Vasco Grilo🔸

In Bayesian reasoning, if two distributions for the same parameter are normal, then their combination is too; its mean is the average of the two primary means, weighting by the respective precisions (inverse variances).

I think this refers to the inverse-variance method. I am not sure under which conditions it should be applied, but it minimises the variance of a weighted mean of 2 estimates of the same variable of interest.

matthew.vandermerwe

I'd love to see the Guesstimate model linked in the report, but the link doesn't work for me.

Pablo

In case this is useful to others, here is a working link. (Thanks to David Roodman for fixing it.)

Comments