On Priors

MichaelDickens

On Priors

MichaelDickens

14 min readApr 26, 2016

Comments 26

Sorted by

New & upvoted

CarlShulman

10y

For a model like this to be effective, we need to choose a good prior belief.

Outside the model, your inchoate 'prior' has to include credence in all the models you could be convinced of by evidence.

A model that fixes the distribution of effectiveness by assumption is unable to accommodate evidence that the distribution is otherwise.

For example, evidence that the Earth is old and will be habitable for hundreds of millions of years is evidence that many kinds of impacts on the world may have big long-run effects. Likewise for astronomical evidence of the resources of the Solar System and other stars.

When you take that into account it increases the expected impact of almost every action, e.g. increasing GDP by $1 has astronomical waste implications. It also has implications for the relative effects of different interventions.

To represent this you need to have a model with uncertainty over the shape of the distribution, e.g. a mixture model of multiple candidate distributions whose weights are updated with evidence.

different priors for different interventions...It’s possible to have a larger effect when helping a group that people with power care less about

Likewise, neglectedness of different areas is subject to empirical inquiry, and much of the evidence we collect in prioritization and evaluation bears on it.

This framework treats a certain subset of evidence about neglectedness very differently from other evidence about neglectedness, or other charity features. All other kinds of evidence, including other evidence about neglectedness (market inefficiencies, biases, taboos, new discoveries, whatever) are discounted severely for top performers, becoming negligible after a certain point, while these differences go undiscounted.

Someone else might make a framework with different priors for direct intervention spending, vs lobbying, vs scientific research, which would similarly favor evidence about intervention type over other evidence for top performers.

MichaelDickens

10y

Couple of important points you're making here.

On your first point, instead of using a single prior distribution I could do a weighted combination of multiple distributions. There are two ways to do this: either have a prior be a combination distribution, or compute multiple posteriors with different distributions and take their weighted average. Not sure which one correctly handles this uncertainty. I haven't done the math but I'd expect that either way, a formulation with distribution probabilities 90% log-normal/10% Pareto will give much more credence to high cost-effectiveness estimates than a pure log-normal. I don't believe it would change the results much to assign small probability to distributions with thinner tails than log-normal (e.g. normal or exponential).

On your second point, yeah I'm including some extra information in the prior, which is kinda wishy-washy. I realize this is suboptimal, but it's better than anything else I've come up with, and probably better than not using a quantitative model at all. Do you know a better way to handle this?

DanielFilan

10y

On your first point, instead of using a single prior distribution I could do a weighted combination of multiple distributions. There are two ways to do this: either have a prior be a combination distribution, or compute multiple posteriors with different distributions and take their weighted average. Not sure which one correctly handles this uncertainty.

Not sure what you mean by a 'combination distribution', but I think something like Carl's suggestion is correct: have a hierarchical model where the type of distribution over effectiveness that you will use is itself a random variable, which the distribution over effectiveness has as a 'hyperparameter'. You could also add a level to the hierarchy by having a distribution over the probabilities for each type of distribution. That being said, it might be convenient to fix these probabilities since it's difficult to put all the evidence you have access to in the model. Probabilistic programming languages are a convenient way to handle such hierarchical models, if you're interested, I recommend checking out this tutorial for an introduction focussing on applications in psychology.

MichaelDickens

10y

Not sure what you mean by a 'combination distribution'

I mean that your prior probability density is given by $P(X) = w_{Pareto} P_{Pareto}(X) + w_{lognorm} P_{lognorm}(X)$ for weights $w$. (You can read LaTeX right?)

DanielFilan

10y

Sure. I think a better thing to do (which I think what Carl is suggesting) is to have a prior distribution over x (the effectiveness of a randomly chosen intervention), and interventionDistribution (a categorical distribution over different shapes you think the space of interventions might have). So P(x, 'Pareto') = P('Pareto') P(x | 'Pareto') = w_{Pareto} P_{Pareto}(x) and P(x, 'logNormal') = P('logNormal') P(x | 'logNormal') = w_{logNormal} P_{logNormal}(x). Then, for the first intervention you see, your prior density over effectiveness is indeed P(x) = w_{Pareto} P_{Pareto}(x) + w_{logNormal} P_{logNormal}(x), but after measuring a bunch of interventions, you can update your beliefs about the empirical distribution of effectivenesses.

MichaelDickens

10y

I wasn't sure if this article was the sort of thing the EA Forum audience is interested in, so let me know. I figured it's better to post and get feedback than to not post and never know.

Habryka [Deactivated]

10y

I am very much in favor of posts like this and would love there to be a lot more posts like this.

Vidur Kapur

10y

I'm very interested in this sort of stuff, though a bit of the maths is beyond me at the moment!

RyanCarey

10y

I'd suggest looking at Carl and Toby's comments on this GiveWell post if you're interested in formulating priors.

CarlShulman

10y

Also note that whereas Holden rejected the Charity Doomsday Argument, clarifying he was talking about relative standing of charities including all flow-through effects (where a big future increases the impact of most interventions astronomically, although some more than others), Dickens embraces it:

I don’t find it plausible that I should be indifferent between $1 to AI safety and $94,200,000,000,000,000 to GiveDirectly...This only considers GiveDirectly’s direct effects and not its flow-through effects, but I still find it implausible that GiveDirectly’s direct effects could matter so much less in expectation than [the flow-through effects of] AI safety work

The specific interventions are a red herring here, it's saying the future won't be big and subject to any effect of our actions (like asteroid defense, or speeding up colonization by 1 day).

This post is also relevant.

Peter Wildeford

10y

I chose values (0.1, 0.3, 0.8, 2.0) because I believe these are approximately the order-of-magnitude standard deviations on the estimates for GiveDirectly, AMF, animal advocacy (in particular corporate campaigns), and x-risk (in particular AI safety) respectively

I'm probably just bad at math, but does a 2.0 SD vs. a 0.1 SD imply that GD is ~20x more robust than AI safety?

Can you elaborate more on how these values are set?

MichaelDickens

10y

It depends on what you mean by "robust".

I chose 0.8 by writing cost-effectiveness estimates of corporate campaigns using 80% credence intervals for the inputs and then calculating the standard deviation of the result. I didn't quite do it that way for GD and AMF; I tried to estimate the standard deviation from looking at GiveWell's historical estimates and its the variation in its employees' current estimates. This was a somewhat rough process but I believe 0.1 and 0.3 are approximately correct.

Peter Wildeford

10y

Also, in your model, QALY improvement is a particularly important cell, but I don't see much quantitative discussion (though there is some qualitative discussion) in the OpenPhil posts. How did you arrive at your number of 0.5 to 1.5, normally distributed? Do you give any credence to Hsiung's view that cage-free is net negative toward hens (though disputed heavily by Bollard)? Do you give any credence to the anti-welfarist argument that cage-free has bad long-term effects on creating a society complacent to some level of harm toward animals?

MichaelDickens

10y

I give some credence to both those things, yes. The anti-welfarist argument doesn't affect this calculation because this calculation only looks at the direct effects of cage-free campaigns, but it does affect my estimate of the long-term value of the campaigns.

The number for QALY improvement is mostly based on my best guess and other people's best guesses; it's hard to say with high accuracy what number we should use.

Peter Wildeford

10y

Thanks. Can you elaborate on what a 1 QALY improvement means in this context? Each chicken's overall life is improved by 1 QALY?

MichaelDickens

10y

It means that the chicken's life is 1 chicken-QALY better per year. There's a separate figure to adjust chicken sentience to human sentience, where I assume that 1 chicken QALY is worth about 0.3 human QALYs.

Peter Wildeford

10y

What does 1 QALY per year mean? Isn't 1 QALY per year already the difference between non-existence and an ideal, healthy life?

MichaelDickens

10y

Yes, so that means if the difference between cage-free and caged is 1 QALY then it's as big a difference as between non-existence and healthy life. So like if I were living on a factory farm for a year, and you gave me the option to reduce my lifespan by 1 year but I get to spend my year on a factory farm without a battery cage, that seems like a reasonable deal to me.

Peter Wildeford

10y

How does the estimate go above 1 QALY/year? Isn't that the maximum possible?

MichaelDickens

10y

No. 1 QALY/year is how good a normal life is. But a life could be better than that, and a life could be more bad than a good life is good. If I'd be willing to give up 10 years of normal life to avert 1 year on a factory farm, then that means a year on a factory farm is worth -10 QALYs.

Peter Wildeford

10y

I can see your model of cage-free campaigns here, how do you translate that standard deviation (37K on one run, 80K on another) into 0.8?

MichaelDickens

10y

The Guesstimate model isn't great, you should look at the one on my spreadsheet instead. My most up-to-date estimate actually has a sigma of 0.56. It's not actually a standard deviation, it's the standard deviation of the log-base-10 of the distribution, which means the difference between the mean and one standard deviation above the mean is 0.56 orders of magnitude.

JesseClifton

10y

Seems like you ought to conduct the analysis with all of the reasonable priors to see how robust your conclusions are, huh?

MichaelDickens

10y

Yeah, I've done some sensitivity analysis to see how the choice of prior affects results. I talk about this some in this essay. In my spreadsheet (which I haven't published yet but will soon), I calculate posteriors for both log-normal and Pareto priors.

RyanCarey

10y

The included estimates might not actually be that accurate, so it might not make sense to use them to construct a prior.

My understanding was that the DCPP estimates were very rough, (for example, one of the estimates for deworming was way off in terms of education benefits, wasn't it) so it wouldn't really make sense to call it a prior based on empirical/field observation.

MichaelDickens

10y

Yep, that's part of the reason why I'm not using those results.

Comments

s	GiveDirectly	AMF	animals	x-risk
0.10	1.03	10.77	1.94e+04	9.69e+39
0.30	1.22	9.36	1.50e+04	7.51e+39
0.80	2.09	6.67	2667.24	1.31e+39
2.00	4.32	6.60	45.21	2.98e+34

s	GiveDirectly	AMF	animals	x-risk
0.10	1.03	10.63	1.90e+04	9.48e+39
0.30	1.23	8.51	1.24e+04	6.21e+39
0.80	1.92	4.99	743.02	3.36e+38
2.00	2.83	3.69	10.10	6.16e+30

s	GiveDirectly	AMF	animals	x-risk
0.10	1.03	10.47	1.85e+04	9.24e+39
0.30	1.24	7.74	9780.87	4.89e+39
0.80	1.80	4.02	185.91	6.16e+37
2.00	2.25	2.74	5.23	1.53e+26

s	GiveDirectly	AMF	animals	x-risk
0.10	1.03	10.21	1.75e+04	8.76e+39
0.30	1.25	6.78	6076.03	3.03e+39
0.80	1.68	3.24	33.64	2.07e+36
2.00	1.88	2.19	3.42	9.42e+16

s	GiveDirectly	AMF	animals	x-risk
0.10	1.00	10.03	1.37e+04	2.89e+38
0.30	1.00	5.83	1453.86	2.58e+29
0.80	1.00	1.96	16.15	1.72e+11
2.00	1.00	1.15	1.79	225.39

On Priors

On Priors

Introduction

Distribution Shape

Intuitive approach

Empirical approach

Theoretical approach

Discussion of approaches

Setting Parameters

Median

Reverse-Engineering Parameters

Pareto prior

Log-normal prior

Observations

Setting Different Priors for Different Interventions

Conclusions

Notes

s	GiveDirectly	AMF	animals	x-risk
0.10	1.00	10.74	1.81e+04	4.02e+39
0.30	1.00	9.02	8828.75	4.98e+36
0.80	1.00	4.32	419.35	2.46e+24
2.00	1.00	1.62	7.25	1.00e+08

s	GiveDirectly	AMF	animals	x-risk
0.10	1.00	10.88	1.91e+04	6.65e+39
0.30	1.00	10.03	1.37e+04	2.89e+38
0.80	1.00	6.47	2231.27	1.39e+31
2.00	1.00	2.37	35.35	2.51e+14

Category	Median
developed-world poor	0.1
global poor	1
factory-farmed animals	10
far future humans	10
wild animals (now and far future)	100
sentient computer programs	100