This is a linkpost for https://confusopoly.com/2019/04/03/the-optimizers-curse-wrong-way-reductions/.
Summary
I spent about two and a half years as a research analyst at GiveWell. For most of my time there, I was the point person on GiveWell’s main cost-effectiveness analyses. I’ve come to believe there are serious, underappreciated issues with the methods the effective altruism (EA) community at large uses to prioritize causes and programs. While effective altruists approach prioritization in a number of different ways, most approaches involve (a) roughly estimating the possible impacts funding opportunities could have and (b) assessing the probability that possible impacts will be realized if an opportunity is funded.
I discuss the phenomenon of the optimizer’s curse: when assessments of activities’ impacts are uncertain, engaging in the activities that look most promising will tend to have a smaller impact than anticipated. I argue that the optimizer’s curse should be extremely concerning when prioritizing among funding opportunities that involve substantial, poorly understood uncertainty. I further argue that proposed Bayesian approaches to avoiding the optimizer’s curse are often unrealistic. I maintain that it is a mistake to try and understand all uncertainty in terms of precise probability estimates.
I go into a lot more detail in the full post.
Late to the party, but I was re-reading this as it relates to another post I'm working on, and I realised I have a question. You write: (note that I say "you" in this comment a lot, but I'd also be interested in anyone else's thoughts on my questions)
That makes sense to me, and seems a very worthwhile point. (It actually seems to me it might have been worth emphasising more, as I think a casual reader could think this post was a critique of formal/explicit/quantitative models in particular.)
But then in a footnote, you add:
I'm not sure I understand what you mean by that, or if it's true/makes sense. It seems to me that, ultimately, if we're engaging in a process that effectively provides a ranking of how good the options seem (whether based on cost-effectiveness estimates or just how we "feel" about them), and there's uncertainty involved, and we pick the option that seems to come out on top, the optimizer's curse will be relevant. Even if we use multiple separate informal ways of looking at the problem, we still ultimately end up with a top ranked option, and, given that that option's ended up on top, we should still expect that errors have inflated its apparent value (whether that's in numerical terms or in terms of how we feel) more than average. Right?
Or did you simply mean that using multiple perspectives means that the various different errors and uncertainties might be more likely to balance out (in the same sort of way that converging lines of evidence based on different methodologies make us more confident that we've really found something real), and that, given that there'd effectively be less uncertainty, the significance of the optimizer's curse would be smaller. (This seems to fit with "the risk of postdecision surprise may be reduced".)
If that's what you meant, that seems reasonable to me, but it seems that we could get the same sort of benefits just by doing something like gathering more data or improving our formal models. (Though of course that may often be more expensive and difficult than cluster thinking, so highlighting that we also have the option of cluster thinking does seem useful.)
Just saw this comment, I'm also super late to the party responding to you!
Totally agree! Honestly, I had several goals with this post, and I almost complete failed on two of them:
- Arguing why utilitarianism can't be the foundation of ethics.
- Without talking much about AI, explaining why I don't think people in the EA community are being reasonable when they suggest there's a de
... (read more)