Impact above Replacement

This is the third in a sequence of posts taken from my recent report: Why Did Environmentalism Become Partisan? Summary Rising partisanship did not make environmentalism more popular or politically effective. Instead, it saw flat or falling overall public opinion, fewer major legislative achievements, and fluctuating executive actions. Public Opinion...

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·2d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Recent opportunities to take action

Time Sensitive Do Gooding Opportunities

Bentham's Bulldog·46m ago·5m read

146

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·1w ago·4m read

You Should Come to The AI Protest

Ronak Mehta·18h ago·5m read

Replaceability is different from counterfactuals. Pearl, Glymour, and Jewell (2016) describes a counterfactual as "an 'if' statement in which the 'if' portion is untrue or unrealized". This involves tallying up all the ways a thing would've gone differently. Replaceability is a special kind of counterfactual reasoning, dealing only with the use (or non-use) of a scarce resource. ↩︎
True, ice time makes this a more subtle calculation. Signing the star left winger means the near-star-calibre left winger gets pushed down to the second line, meaning their (considerable) impact is reduced. But I think it serves as an example of these kinds of considerations mattering in practice. ↩︎
I frame it as "how much effort you add", not "how much impact you have", because impact also depends on other things, in particular the problem areas' relative scale (defined, after 80,000 Hours, as Good Done ÷ % of Problem Solved) and solvability (% of Problem Solved ÷ % Increase in Effort). Focusing on effort alone is cleaner as we can bracket those other concepts. As far as this post is concerned, all problems have the same scale and solvability.

NB. "Increase in Effort" is called by 80,000 Hours "Increase in Resources", but since I'm already using the word "resource" to refer to labour, time and money, I'm calling it "Increase in Effort" instead. ↩︎
Some posts, like this one, point to income and researcher citation count as evidence of this (emphasis mine): "If job performance is like income, or the number of citations people have on academic papers, it is more like a log normal distribution[.] That is, most aspiring academics have few citations, while some have thousands, tens of thousands, or even hundreds of thousands. [...] We're very unsure about this question, and would like to see more research into it. Some evidence we've seen suggests that output is normally distributed even in 'complex' jobs, like being a doctor. However, for the most difficult and creative work, like academic research, we suspect that the variance is high in the tails. Even there, it's hard to be confident since many measures of output (such as citation count) are likely to overstate differences in productivity."

As alluded to in the quoted passage, I think income and citation count aren't good evidence. Even if talent is normally distributed (i.e. follows a bell curve), salary and citations could well have heavy tails due to nonlinear effects later in the causal chain. The Matthew effect – where having an advantage gets you further advantages – applies here too, as well-cited papers are more likely to get further citations independently of quality, and richer people are more likely to get more money regardless of talent. ↩︎
Benjamin Todd calls this a "shared aim community". ↩︎
This model implicitly takes neglectedness and personal fit into account – neglectedness (and importance) is captured by a job's impact level, and personal fit is captured by a person's talent level. ↩︎
Page (2018) writes: "In some cases, we may know the mean of the distribution and also know that all values must be positive. Given those constraints, the maximal entropy distribution must have a long tail, and as we spread the distribution across more values, we must balance high values with many low-value outcomes."

I don't know the means of these talent distributions, but it does seem likely to me that (a) that talent can't be negative and (b) the distance between the average talent and zero talent is smaller than the distance between the average talent and the greatest talent. That seems like a pretty good justification for a heavy-tailed distribution. ↩︎
Note that this means that, if the number of people exceeds the number of jobs, you'll tend to have an above average talent level. If there are 10x as many people as jobs, for example, you're randomly selected from the 90th percentile. ↩︎

Decision	Scarce resource
Choosing a job	Salary, opportunities for direct impact and support from employer and colleagues
Asking someone to be a mentor	Mentor's time and energy
Applying for a grant	Grant, grantmaker's time and grantmaker's connections
Marketing a resource to the community	Prestige and community attention

#	Field A	Field B	Naive	Single Comparison	Replacement	God
1	1K ppl (medium talent), 100 jobs	1K ppl (medium talent), 100 jobs	49.9%	50.7%	50.0%	50.1%
2	1K ppl (high talent), 100 jobs	1K ppl (medium talent), 100 jobs	69.6%	62.3%	61.8%	62.4%
3	1K ppl (medium talent), 100 jobs	100 ppl (medium talent), 10 jobs	50.7%	26.0%	53.4%	49.4%
4	1K ppl (medium talent), 100 jobs	200 ppl (medium talent), 100 jobs	67.9%	55.8%	56.2%	56.5%

Impact above Replacement

Impact above Replacement

Summary

What Replaceability Is

Four Views on Impact

Why Does Replaceability Matter?

Simulations

The Replacement View Seems Useful, Maybe

Appendix: Monte Carlo Simulations

References