This is a special post for quick takes by Mo Putera. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.
Sorted by Click to highlight new quick takes since: Today at 5:33 PM

Why did India's happiness ratings consistently drop so much over time even as its GDP per capita rose?

Epistemic status: confused. Haven't looked into this for more than a few minutes

My friend recently alerted me to an observation that puzzled him: this dynamic chart from Our World in Data's happiness and life satisfaction article showing how India's self-reported life satisfaction dropped an astounding -1.20 points (4.97 to 3.78) from 2011 to 2021, even as its GDP per capita rose +51% (I$4,374 to I$6,592 in 2017 prices): 

(I included China for comparison to illustrate the sort of trajectory I expected to see for India.)

The sliding year scale on OWID's chart shows how this drop has been consistent and worsening over the years. This picture hasn't changed much recently: the most recent 2024 World Happiness Report reports a 4.05 rating averaged over the 3-year window 2021-23, only slightly above the 2021 rating.

A -1.20 point drop is huge. For context, it's 10x(!) larger than the effect of doubling income at +0.12 LS points (Clarke et al 2018 p199, via HLI's report), and compares to major negative life events like widowhood and extended unemployment: 

The effect of life events on life satisfaction

Given India's ~1.4 billion population, such a large drop is alarming: roughly ~5 billion LS-years lost since 2011, very roughly ballparking. For context, and keeping in mind that LS-years and DALYs aren't the same thing, the entire world's DALY burden is ~2.5 billion DALYs p.a. 

But – again caveating with my lack of familiarity with the literature and extremely cursory look into this – I haven't seen any writeup look into this, which makes me wonder if it's not a 'real issue'? For instance, the 2021 WHR just says

Since 2006-08, world well-being has been static, but life expectancy increased by nearly four years up to 2017-19 (we shall come to 2020 later). The rate of progress differed a lot across regions. The biggest improvements in life expectancy were in the former Soviet Union, in Asia, and (the greatest) in Sub-Saharan Africa. And these were the regions that had the biggest increases in WELLBYs. In Asia, the exception is South Asia, where India has experienced a remarkable fall in Well-being which more than outweighs its improved life expectancy.

That's it: no elaboration, no footnotes, nothing.

So what am I missing? What's going on here? 

A quick search turned up this WEF article (based on Ipsos data and research, not the WHR's Gallup World Poll, so take it with a grain of salt) pointing to

  • increased internet access -> pressure to portray airbrushed lives on social media & a feeling that 'their lives have become meaningless' 
  • covid-19 mitigation-induced isolation curtailing activities that improve wellbeing (employment, socializing, going to school, exercising and accessing health services)
  • urban migration to seek work -> traffic congestion, noise and pollution, demanding bosses -> less sleep and exercise -> higher anxiety and worsening health 

But I'm not sure these factors are differential (i.e. that they, for instance, happen much more in India than elsewhere s.t. it explains the wellbeing vs development trajectory difference over 2011-24)?

Interesting! I think figure 2.1 here provides a partial answer. According to the FAQ: 

"the sub-bars show the estimated extent to which each of the six factors (levels of GDP, life expectancy, generosity, social support, freedom, and corruption) is estimated to contribute to making life evaluations higher in each country than in Dystopia. Dystopia is a hypothetical country with values equal to the world’s lowest national averages for each of the six factors (see FAQs: What is Dystopia?). The sub-bars have no impact on the total score reported for each country but are just a way of explaining the implications of the model estimated in Table 2.1. People often ask why some countries rank higher than others—the sub-bars (including the residuals, which show what is not explained) attempt to answer that question."

India seems to score very low on social support, compared to similarly ranked countries.

I did some googling and found this, which shows the sub-factors over time for India. Looks like social support declined a lot, but is now increasing again. 

I haven't checked whether it declined more than in other countries and, if it has, I'm not sure why it has.  

Thank you for the pointer!

Your second link helped me refine my line of questioning / confusion. You're right that social support declined a lot, but the sum of the six key variables (GDP per capita, etc) still mostly trended upwards over time, huge covid dip aside, which is what I'd expect in the India development success story. 

It's the dystopia residual that keeps dropping, from 2.275 - 1.83 = 0.445 in 2015 (i.e. Indians reported 0.445 points higher life satisfaction than you'd predict using the model) to 0.979 - 1.83 = -0.85, an absolute plummeting of life satisfaction across a sizeable fraction of the world population, that's for some reason not explained by the six key variables. Hm... 

(please don't feel obliged to respond – I appreciate the link!)

Could this be related to the rising level of inequality in happiness levels in Asia? (See the graph on page 44 of the WHR2024). It can be assumed that the benefits of GDP growth are not evenly distributed, and increasing inequalities trigger frustration and a decrease in well-being in the majority of the population (since to a certain extent, the sense of welfare is relative).

This is how Our World in Data explains a similar phenomenon in the US: "Income inequality in the US is exceptionally high and has been on the rise in the last four decades, with incomes for the median household growing much more slowly than incomes for the top 10%. As a result, trends in aggregate life satisfaction should not be seen as paradoxical: the income and standard of living of the typical US citizen have not grown much in the last couple of decades."

Yeah rising inequality is a good guess, thank you – the OWID chart also shows the US experiencing the same trajectory direction as India (declining average LS despite rising GDP per capita). I suppose one way to test this hypothesis is to see if China had inequality rise significantly as well in the 2011-23 period, since it had the expected LS-and-GDP-trending-up trajectory. Probably a weak test due to potential confounders... 

This WHO press release was a good reminder of the power of immunization – a new study forthcoming publication in The Lancet reports that (liberally quoting / paraphrasing the release)

  • global immunization efforts have saved an estimated 154 million lives over the past 50 years, 146 million of them children under 5 and 101 million of them infants 
  • for each life saved through immunization, an average of 66 years of full health were gained – with a total of 10.2 billion full health years gained over the five decades
  • measles vaccination accounted for 60% of the lives saved due to immunization, and will likely remain the top contributor in the future 
  • vaccination against 14 diseases has directly contributed to reducing infant deaths by 40% globally, and by more than 50% in the African Region
    • the 14 diseases: diphtheria, Haemophilus influenzae type B, hepatitis B, Japanese encephalitis, measles, meningitis A, pertussis, invasive pneumococcal disease, polio, rotavirus, rubella, tetanus, tuberculosis, and yellow fever
  • fewer than 5% of infants globally had access to routine immunization when the Expanded Programme on Immunization (EPI) was launched 50 years ago in 1974 by the World Health Assembly; today 84% of infants are protected with 3 doses of the vaccine against diphtheria, tetanus and pertussis (DTP) – the global marker for immunization coverage
  • there's still a lot to be done – for instance, 67 million children missed out on one or more vaccines during the pandemic years

As someone predisposed to like modeling, the key takeaway I got from Justin Sandefur's Asterisk essay PEPFAR and the Costs of Cost-Benefit Analysis was this corrective reminder – emphasis mine, focusing on what changed my mind:

Second, economists were stuck in an austerity mindset, in which global health funding priorities were zero-sum: $300 for a course of HIV drugs means fewer bed nets to fight malaria. But these trade-offs rarely materialized. The total budget envelope for global public health in the 2000s was not fixed. PEPFAR raised new money. That money was probably not fungible across policy alternatives. Instead, the Bush White House was able to sell a dramatic increase in America’s foreign aid budget by demonstrating that several billion dollars could, realistically, halt an epidemic that was killing more people than any other disease in the world. 

...

A broader lesson here, perhaps, is about getting counterfactuals right. In comparative cost-effectiveness analysis, the counterfactual to AIDS treatment is the best possible alternative use of that money to save lives. In practice, the actual alternative might simply be the status quo, no PEPFAR, and a 0.1% reduction in the fiscal year 2004 federal budget. Economists are often pessimistic about the prospects of big additional spending, not out of any deep knowledge of the budgeting process, but because holding that variable fixed makes analyzing the problem more tractable. In reality, there are lots of free variables.

More detail:

Economists’ standard optimization framework is to start with a fixed budget and allocate money across competing alternatives. At a high-level, this is also how the global development community (specifically OECD donors) tends to operate: foreign aid commitments are made as a proportion of national income, entirely divorced from specific policy goals. PEPFAR started with the goal instead: Set it, persuade key players it can be done, and ask for the money to do it.

Bush didn’t think like an economist. He was apparently allergic to measuring foreign aid in terms of dollars spent. Instead, the White House would start with health targets and solve for a budget, not vice versa. ... Economists are trained to look for trade-offs. This is good intellectual discipline. Pursuing “Investment A” means forgoing “Investment B.” But in many real-world cases, it’s not at all obvious that the realistic alternative to big new spending proposals is similar levels of big new spending on some better program. The realistic counterfactual might be nothing at all.

In retrospect, it seems clear that economists were far too quick to accept the total foreign aid budget envelope as a fixed constraint. The size of that budget, as PEPFAR would demonstrate, was very much up for debate.

When Bush pitched $15 billion over five years in his State of the Union, he noted that $10 billion would be funded by money that had not yet been promised. And indeed, 2003 marked a clear breaking point in the history of American foreign aid. In real-dollar terms, aid spending had been essentially flat for half a century at around $20 billion a year. By the end of Bush’s presidency, between PEPFAR and massive contracts for Iraq reconstruction, that number hovered around $35 billion. And it has stayed there since. 

Compared to normal development spending, $15 billion may have sounded like a lot, but exactly one sentence after announcing that number in his State of the Union address, Bush pivoted to the case for invading Iraq, a war that would eventually cost America something in the region of $3 trillion — not to mention thousands of American and hundreds of thousands of Iraqi lives. Money was not a real constraint.

Tangentially, I suspect this sort of attitude (Iraq invasion notwithstanding) would naturally arise out of a definite optimism mindset (that essay by Dan Wang is incidentally a great read; his follow-up is more comprehensive and clearly argued, but I prefer the original for inspiration). It seems to me that Justin has this mindset as well, cf. his analogy to climate change in comparing economists' carbon taxes and cap-and-trade schemes vs progressive activists pushing for green tech investment to bend the cost curve. He concludes: 

You don’t have to give up on cost-effectiveness or utilitarianism altogether to recognize that these frameworks led economists astray on PEPFAR — and probably some other topics too. Economists got PEPFAR wrong analytically, not emotionally, and continue to make the same analytical mistakes in numerous domains. Contrary to the tenets of the simple, static, comparative cost-effectiveness analysis, cost curves can sometimes be bent, some interventions scale more easily than others, and real-world evidence of feasibility and efficacy can sometimes render budget constraints extremely malleable. Over 20 years later, with $100 billion dollars appropriated under both Democratic and Republican administrations, and millions of lives saved, it’s hard to argue a different foreign aid program would’ve garnered more support, scaled so effectively, and done more good. It’s not that trade-offs don’t exist. We just got the counterfactual wrong.

Aside from his climate change example above, I'd be curious to know what other domains economists are making analytical mistakes in w.r.t. cost-benefit modeling, since I'm probably predisposed to making the same kinds of mistakes. 

The 1,000-ton rule is Richard Parncutt's suggestion for reframing the political message of the severity of global warming in particularly vivid human rights terms; it says that someone in the next century or two is prematurely killed every time humanity burns 1,000 tons of carbon. 

I came across this paper while (in the spirit of Nuno's suggestion) trying to figure out the 'moral cost of climate change' so to speak, driven by my annoyance that e.g. climate charity BOTECs reported $ per ton of CO2-eq averted in contrast to (say) the $ per death averted bottomline of GHW charities, since I don't intrinsically care to avert CO2-equivalent emissions the way I do about averting deaths. (To be clear, I understand why the BOTECs do so and would do the same for work; this is for my own moral clarity.)

Parncutt's derivation is simple: burning a trillion tons of carbon will cause ~2 °C of anthropogenic global warming, which will in turn cause 1 - 10 million premature deaths a year "for a period of several centuries", something like this:  

www.frontiersin.orgModelling the rise in global mean surface temperature (GMST) as a function of carbon burned is already very hard; Parncutt doesn't try to model premature deaths as a function of GMST but just makes a semi-quantitative order-of-magnitude estimation anchored extensively at the lower and upper ends to various catastrophic outcomes discussed in the literature on climate change, and assumes a lognormal distribution around a billion future deaths with a 10x range for worst-vs-best case scenario, which over time looks 'very approximately' like this:

The lower line represents deaths due to poverty without AGW. As the negative effect of AGW overtakes the positive effect of development, the death rate will increase, as shown by the upper line. In a more accurate model, the upper line might be concave upward on the left (exponential increase) and concave downward on the right (approaching a peak).

www.frontiersin.org

Based on the 1,000-ton rule, Pearce & Parncutt suggest the 'millilife' as "an accessible unit of measure for carbon footprints that is easy to understand and may be used to set energy policy to help accelerate carbon emissions reductions". A millilife is a measure of intrinsic value defined to be 1/1000th of a human life; the 1,000-ton rule says that burning a ton of fossil carbon destroys a millilife. This lets Pearce & Parncutt make statements like these, at an individual level (all emphasis mine):

For example in Canada, which has some of the highest yearly carbon emissions per capita in the world at around 19 tons of CO2 or 5 tons of carbon per person, roughly 5 millilives are sacrificed by an average person each year. As the average Canadian lives to be about 80, he/she sacrifices about 400 millilives (0.4 human lives) in the course of his/her lifetime, in exchange for a carbon-intensive lifestyle

and 

... an average future AGW-victim in a developing country will lose half of a lifetime or 30–40 life-years, as most victims will be either very young or very old. If the average climate victim loses 35 life-years (or 13,000 life-days), a millilife corresponds to 13 days. 

Stated in another way: if a person is responsible for burning a ton of fossil carbon by flying to another continent and back, they effectively steal 13 days from the life of a future poor person living in the developing world. If the traveler takes 1000 such trips, they are responsible for the death of a future person.

and for "large-scale energy decisions":

... the Adani Carmichael coalmine in Queensland, Australia, is currently under construction and producing coal since 2021. Despite massive protests over several years, it will be the biggest coalmine ever. Its reserves are up to 4 billion tons of coal, or 3 billion tons of carbon. If all of that was burned, the 1000-tonne rule says it would cause the premature deaths of 3 million future people. Given that the 1000-tonne rule is only an order-of-magnitude estimate, the number of caused deaths will lie between one million and 10 million. ... Many of those who will die are already living as children in the Global South; burning Carmichael coal will cause their future deaths with a high probability. Should energy policy allow that to occur?

Pearce & Parncutt then use the 1,000-ton rule and millilife to make various suggestions. Here's one:

Under what circumstances might a government ban or outlaw an entire corporation or industry, considered a legal entity or person—for example, the entire global coal industry? ...

Ideally, a company should not cause any human deaths at all. If it does, those deaths should be justifiable in terms of improvements to the quality of life of others. For example, a company that builds a bridge might reasonably risk a future collapse that would kill 100 people with a probability of 1%. In that case, the company accepts that on average one future person will be killed as a result of the construction of the bridge. It may be reasonable to claim that the improved quality of life for thousands or millions of people who cross the bridge justifies the human cost.

Fossil fuel industries are causing far more future deaths than that, raising the question of the point at which the law should intervene. As a first step to solving this problem, it has been proposed a rather high threshold (generous toward the corporations) is appropriate. A company does not have the right to exist if its net impact on human life (e.g., a company/industry might make products that save lives like medicine but do kill a small fraction of users) is such that it kills more people than it employs. This requirement for a company’s existence is thus:

Number of future premature deaths/year < Number of full-time employees (1)

This criterion can be applied to an entire industry. If the industry kills more people than it employs, then primary rights (life) are being sacrificed for secondary rights (jobs or profits) and the net benefit to humankind is negative. If an industry is not able to satisfy Equation (1), it should be closed down by the government

... the coal industry kills people by polluting the air that they breathe. ... In the U.S., about 52,000 human lives are sacrificed per year to provide coal-fired electricity. ... In the U.S., coal employed 51,795 people in 2016. Since the number of people killed is greater than the number employed, the U.S. coal industry does not satisfy Equation (1) and should be closed down. This conservative conclusion does not include future deaths caused by climate change due to burning coal. 

One more energy policy suggestion (there's many more in the paper): 

Applying asset forfeiture laws (also referred to as asset seizure) to manslaughter caused by AGW. These laws enable the confiscation of assets by the U.S. government as a type of criminal-justice financial obligation that applies to the proceeds of crime. Essentially, if criminals profit from the results of unlawful activity, the profits (assets) are confiscated by the authorities. 

This is not only a law in the U.S. but is in place throughout the world. For example, in Canada, Part XII.2 of the Criminal Code, provides a national forfeiture régime for property arising from the commission of indictable offenses. Similarly, ‘Son of Sam laws’ could also apply to carbon emissions. In the U.S., Son of Sam laws refer to laws designed to keep criminals from profiting from the notoriety of their crimes and often authorize the state to seize funds earned by the criminals to be used to compensate the criminal’s victims.

If that logic of asset forfeiture is applied to fossil fuel company investors who profit from carbon-emission-related manslaughter, taxes could be set on fossil fuel profits, dividends, and capital gains at 100% and the resultant tax revenue could be used for energy efficiency and renewable energy projects or to help shield the poor from the most severe impacts of AGW. ...

Such AGW-focused asset forfeiture laws would also apply to fossil fuel company executive compensation packages. Energy policy research has shown that it is possible to align energy executive compensation with careful calibration of incentive equations such that the harmful effects of emissions can be prevented through incentive pay. Executives who were compensated without these safeguards in place would have their incomes seized the same as other criminals benefiting materially from manslaughter.

I have no (defensible) opinion on these suggestions; curious to know what anyone thinks. 

[Question] How should we think about the decision relevance of models estimating p(doom)?

(Epistemic status: confused & dissatisfied by what I've seen published, but haven't spent more than a few hours looking. Question motivated by Open Philanthropy's AI Worldviews Contest; this comment thread asking how OP updated reminded me of my dissatisfaction. I've asked this before on LW but got no response; curious to retry, hence repost) 

To illustrate what I mean, switching from p(doom) to timelines: 

  • The recent post AGI Timelines in Governance: Different Strategies for Different Timeframes was useful to me in pushing back against Miles Brundage's argument that "timeline discourse might be overrated", by showing how choice of actions (in particular in the AI governance context) really does depend on whether we think that AGI will be developed in ~5-10 years or after that. 
  • A separate takeaway of mine is that decision-relevant estimation "granularity" need not be that fine-grained, and in fact is not relevant beyond simply "before or after ~2030" (again in the AI governance context). 
  • Finally, that post was useful to me in simply concretely specifying which actions are influenced by timelines estimates.  

Question: Is there something like this for p(doom) estimates? More specifically, following the above points as pushback against the strawman(?) that "p(doom) discourse, including rigorous modeling of it, is overrated":

  1. What concrete high-level actions do most alignment researchers agree are influenced by p(doom) estimates, and would benefit from more rigorous modeling (vs just best guesses, even by top researchers e.g. Paul Christiano's views)?
  2. What's the right level of granularity for estimating p(doom) from a decision-relevant perspective? Is it just a single bit ("below or above some threshold X%") like estimating timelines for AI governance strategy, or OOM (e.g. 0.1% vs 1% vs 10% vs >50%), or something else?
    • I suppose the easy answer is "the granularity depends on who's deciding, what decisions need making, in what contexts", but I'm in the dark as to concrete examples of those parameters (granularity i.e. thresholds, contexts, key actors, decisions)
    • e.g. reading Joe Carlsmith's personal update from ~5% to >10% I'm unsure if this changes his recommendations at all, or even his conclusion – he writes that "my main point here, though, isn't the specific numbers... [but rather that] here is a disturbingly substantive risk that we (or our children) live to see humanity as a whole permanently and involuntarily disempowered by AI systems we’ve lost control over", which would've been true for both 5% and 10%

Or is this whole line of questioning simply misguided or irrelevant?


Some writings I've seen gesturing in this direction:

  • harsimony's argument that Precise P(doom) isn't very important for prioritization or strategy ("identifying exactly where P(doom) lies in the 1%-99% range doesn't change priorities much") amounts to the 'single bit granularity' answer
    • Carl Shulman disagrees, but his comment (while answering my 1st bullet point) isn't clear in the way the different AI gov strategies for different timelines post is, so I'm still left in the dark – to (simplistically) illustrate with a randomly-chosen example from his reply and making up numbers, I'm looking for statements like "p(doom) < 2% implies we should race for AGI with less concern about catastrophic unintended AI action, p(doom) > 10% implies we definitely shouldn't, and p(doom) between 2-10% implies reserving this option for last-ditch attempts", which he doesn't provide
  • Froolow's attempted dissolution of AI risk (which takes Joe Carlsmith's model and adds parameter uncertainty – inspired by Sandberg et al's Dissolving the Fermi paradox – to argue that low-risk worlds are more likely than non-systematised intuition alone would suggest) 
    • Froolow's modeling is useful to me for making concrete recommendations for funders, e.g. (1) "prepare at least 2 strategies for the possibility that we live in one of a high-risk or low-risk world instead of preparing for a middling-ish risk", (2) "devote significantly more resources to identifying whether we live in a high-risk or low-risk world", (3) "reallocate resources away from macro-level questions like 'What is the overall risk of AI catastrophe?' towards AI risk microdynamics like 'What is the probability that humanity could stop an AI with access to nontrivial resources from taking over the world?'", (4) "When funding outreach / explanations of AI Risk, it seems likely it would be more convincing to focus on why this step would be hard than to focus on e.g. the probability that AI will be invented this century (which mostly Non-Experts don’t disagree with)". I haven't really seen any other p(doom) model do this, which I find confusing 
  • I'm encouraged by the long-term vision of the MTAIR project "to convert our hypothesis map into a quantitative model that can be used to calculate decision-relevant probability estimates", so I suppose another easy answer to my question is  just "wait for MTAIR", but I'm wondering if there's a more useful answer to the "current SOTA" than this. To illustrate, here's (a notional version of) how MTAIR can help with decision analysis, cribbed from that introduction post: 

This question was mainly motivated by my attempt to figure out what to make of people's widely-varying p(doom) estimates, e.g. in the appendix section of Apart Research's website, beyond simply "there is no consensus on p(doom)". I suppose one can argue that rigorous p(doom) modeling helps reduce disagreement on intuition-driven estimates by clarifying cruxes or deconfusing concepts, thereby improving confidence and coordination on what to do, but in practice I'm unsure if this is the case (reading e.g. the public discussion around the p(doom) modeling by Carlsmith, Froolow, etc), so I'm not sure I buy this argument, hence my asking for concrete examples.

One of the more surprising things I learned from Karen Levy's 80K podcast interview on misaligned incentives in global development was how her experience directly contradicted a stereotype I had about for-profits vs nonprofits: 

Karen Levy: When I did Y Combinator, I expected it to be a really competitive environment: here you are in the private sector and it’s all about competition. And I was blown away by the level of collaboration that existed in that community — and frankly, in comparison to the nonprofit world, which can be competitive. People compete for funding, and so very often we’re fighting over slices of the same pie. Whereas the Y Combinator model is like, “We’re making the pie bigger. It’s getting bigger for everybody.”

My assumption had been that the opposite was true. 

The following table is from Scott Alexander's post, which you should check out for the sources and (many, many) caveats. 

This table can’t tell you what your ethical duties are. I'm concerned it will make some people feel like whatever they do is just a drop in the bucket - all you have to do is spend 11,000 hours without air conditioning, and you'll have saved the same amount of carbon an F-35 burns on one airstrike! But I think the most important thing it could convince you of is that if you were previously planning on letting yourself be miserable to save carbon, you should buy carbon offsets instead. Instead of boiling yourself alive all summer, spend between $0.04 and $2.50 an hour to offset your air conditioning use.

I like John Salter's post on schlep blindness in EA (inspired by Paul Graham's eponymous essay), whose key takeaway is 

Pay close attention to ideas that repel others people for non-impact related reasons, but not you. If you can get obsessed about something important that most people find horribly boring, you're uniquely well placed to make a big impact.

Unfortunately it's bereft of concrete examples. The closest to a shortlist he shares is in this comment:

  • Horrible career moves e.g. investigating the corrupt practices of powerful EAs / Orgs
  • Boring to most people e.g. compiling lists and data
  • Low status outside EA e.g. welfare of animals nobody cares about (e.g. shrimp)
  • Low status within EA e.g. global mental health
  • Living in relatively low quality of living areas e.g. fieldwork in many African countries

(I disagree with some of these; e.g. the first bullet seems contradicted by the propensity for forum drama on adjacent topics, and as someone who likes compiling lists and data I don't actually see much low-hanging fruit for me to contribute here due to the work of e.g. Hamish)

I'd be keen to learn other examples. He does give this advice to brainstorm examples:

What work do you wish someone else would do?

although in my case it's not useful because I either just end up doing it (or trying, failing, and learning why), or discover that it's already been done better than I could (e.g. Rethink Priorities' new CCM).

That said, I still think the original takeaway is a useful reminder. 

I'm curious what people who're more familiar with infinite ethics think of Manheim & Sandberg's What is the upper limit of value?, in particular where they discuss infinite ethics (emphasis mine):

Bostrom’s discussion of infinite ethics is premised on the moral relevance of physically inaccessible value. That is, it assumes that aggregative utilitarianism is over the full universe, rather than the accessible universe. This requires certain assumptions about the universe, as well as being premised on a variant of the incomparability argument that we dismissed above, but has an additional response which is possible, presaged earlier. Namely, we can argue that this does not pose a problem for ethical decision-making even using aggregative ethics, because the consequences of any ethical decision can have only a finite (difference in) value. This is because the value of a moral decision relates only to the impact of that decision. Anything outside of the influenced universe is not affected, and the arguments above show that the difference any decision makes is finite.

I first read their paper a few years ago and found their arguments for the finiteness of value persuasive, as well as their collectively-exhaustive responses in section 4 to possible objections. So ever since then I've been admittedly confused by claims that the problems of infinite ethics still warrant concern w.r.t. ethical decision-making (e.g. I don't really buy Joe Carlsmith's arguments for acknowledging that infinities matter in this context, same for Toby Ord's discussion in a recent 80K podcast). What am I missing?

Sandberg's recent 80K podcast interview transcript has this quote:

Rob Wiblin: OK, so the argument is something like valuing is a process that requires information to be encoded, and information to be processed — and there are just maximum limits on how much information can be encoded and processed given a particular amount of mass and given a finite amount of mass and energy. So that ultimately is going to set the limit on how much valuing can be done physically in our universe. No matter what things we create, no matter what minds we generate, there’s going to be some finite limit there. That’s basically it?

Anders Sandberg: That’s it. In some sense, this is kind of trivial. I think some readers would no doubt feel almost cheated, because they wanted to know that metaphysical limit for value, and we can’t say anything about that. But it seems very likely that if value has to have to do with some entity that is doing the valuing, then there is always going to be this limit — especially since the universe is inconveniently organised in such a way that we can’t get hold of infinite computational power, as far as we know.

I just learned about Tom Frieden via Vadim Albinsky's writeup Resolve to Save Lives Trans Fat Program for Founders Pledge. His impact in sheer lives saved is astounding, and I'm embarrassed I didn't know about him before: 

The CEO of RTSL, Tom Frieden, likely prevented tens of millions of deaths by creating an international tobacco control initiative in a prior role that may have been much more cost effective than most of our top recommended charities. ...

We believe that by leveraging his influence with governments, and the relatively low cost of advocating for regulations to improve health, Tom Frieden has the potential to again save a vast number of lives at a low cost. 

How many more? Albinsky estimates:

RTSL is aiming to save 94 million lives over 25 years by advocating for countries to implement policies to reduce non-communicable diseases. We believe the industrially-produced trans fat elimination program is the most cost-effective of their initiatives. ... Even after very conservative discounts to RTLS’s impact projections we estimate this program to be more cost effective than most of our top global health and development recommendations.

Tangentially, if a "Borlaug" is a billion lives saved, then Frieden's impact is probably on the scale of ~100 milliBorlaugs (to nearest OOM). Bill and Melinda likely have had similar impact. This makes me wonder who else I don't know about who's done ~100 milliBorlaugs of good. 

(It's arguably unfair to wholly attribute all those lives saved to Frieden, and I am honestly unsure what credit attribution makes most sense, but applying the same logic to Borlaug you can no longer really say he saved a billion lives.)

Notes from Ozy Brennan's On capabilitarianism 

  • Martha Nussbaum's first-draft list of central capabilities (for humans)
    • Life
    • Bodily health
    • Bodily integrity
    • Senses, Imagination, and Thought
    • Emotions
    • Practical reason
    • Affiliation 
    • Other species
    • Play
    • Control over political & material environment
  • the Five Freedoms (for animals) 
    • Freedom from hunger and thirst
    • Freedom from discomfort
    • Freedom from pain, injury, and disease
    • Freedom to express normal behavior
    • Freedom from fear and distress

Some notes from trying out Rethink Priorities' new cross-cause cost-effectiveness model (CCM) from their post, for personal reference:

Cost-effectiveness in DALYs per $1k (90% CI) / % of simulation results with positive outcomes - negative outcomes - no effects / alternative weightings of cost-eff under different risk aversion profiles and weighting schemes in weighted DALYs per $1k, min to max values 

  • GHD: 
    • US govt GHD: 1 (range: 0.85 - 1.22) / 100% positive / risk 1 - 1
    • Cash: 1.7 (range 1.1 - 2.5) / 100% positive / risk 1 - 2
    • GW bar: 21 (range: 11 - 42) / 100% positive / risk 16 - 21 (OP bar has ~similar figures)
    • Good intervention (per OP & GW): 39 (range: 15 - 67) / 100% positive / risk 31 - 39
  • AW - generic interventions: 
    • Black soldier fly: 5.6 (range: 95% below 11.4) / 16% positive, 84% no effect / risk 0 - 6
    • Shrimp: 7.8 (range: 95% below 8.0) / 19% positive, 81% no effect / risk 0 - 8
    • Carp: 36 (range: 95% below  145) / 31% positive, 69% no effect / risk 2 - 36
    • Chicken: 719 (range: 95% below 2,100) / 81% positive, 19% no effect / risk 221 - 717
  • x-risk:
    • Portfolio of biorisk projects ($15-30M budget, 60% chance no effect, 70% effect is positive): 132 (middle 99.9% of expected utility is 0) / >99.9% no effect / risk 0 - 132
    • Nanotech safety megaproject ($10-30M budget, 90% chance no effect, 70% effect is positive): 73 (middle 99.9% of EU is 0) / >99.9% no effect / risk -10 - 73
    • AI misalignment megaproject ($8-28B budget, 97.3% chance no effect, 70% effect is positive): 154 (middle 99.9% of EU is 27, 99% is 0) / >99.6% no effect / risk -56 - 154
  • Some things that jumped out at me (caveating that I don't work in any of these areas):
    • I'm a little surprised that only chicken campaigns are modeled as clearly higher EV (OOM-wise) than GHD interventions considered good by GW & OP's lights, while interventions for other nonhuman animals fall short
    • I'm also surprised that chickens > all other nonhuman animals on both EV and p(+ve simulation outcome). There's some discussion that seems to indicate that cage-free work seems to be much lower EV now than previously, although I'm not sure if it changes the takeaway (and in any case funding prioritization shouldn't be purely EV-based
    • I'm surprised yet again that a >$10B AI misalignment megaproject is modeled as having no effect in >99.6% of simuls. I probably hadn't internalized the 'hits' in 'hits-based giving' as well as I should, since my earlier gut intuition (based on no data whatsoever) was that a near-Manhattan-scale megaproject would surely have some effect in >10% of possible worlds
    • I didn't expect the model to say chickens > misaligned AI, unsafe nanotech and biorisk from a risk-neutral EV perspective. That said, the x-risk inputs are in some sense just placeholders, so I don't put much weight in this

In any case, I'd be curious to see how the CCM is taken into consideration by funders and other stakeholders going forward.

I thought I had mostly internalized the heavy-tailed worldview from a life-guiding perspective, but reading Ben Kuhn's searching for outliers made me realize I hadn't. So here are some summarized reminders for posterity:  

  • Key idea: lots of important things in life generated by multiplicative processes resulting in heavy-tailed distributions – jobs, employees / colleagues, ideas, romantic relationships, success in business / investing / philanthropy, how useful it is to try new activities  
  • Decision relevance to living better, i.e. what Ben thinks I should do differently:
    • Getting lots of samples improves outcomes a lot, so draw as many samples as possible
    • Trust the process and push through the demotivation of super-high failure rates (instead of taking them as evidence that the process is bad)
    • But don't just trust any process; it must have 2 parts: (1) a good way to tell if a candidate is an outlier ("maybe amazing" below) (2) a good way to draw samples 
    • Optimize less, draw samples more (for a certain type of person)
    • Filter for "maybe amazing", not "probably good", as they have different traits
    • Filter for "ruling in" candidates, not "ruling out" (e.g. in dating)
    • Cultivate an abundance mindset to help reject more candidates early on (to find 99.9th percentile not just 90th)
    • Think ahead about what outliers look like, to avoid accidentally rejecting 99.9th percentile candidates out of miscalibration, by asking others based on their experience 
  • My reservations with Ben's advice, despite thinking they're mostly sound and idea-generating:
    • "Stick with the process through super-high failure rates instead of taking them as evidence that the process is bad" feels uncomfortably close to protecting a belief from falsification
    • Filtering for "maybe amazing", not "probably good" makes me uncomfortable because I'm not risk-neutral (e.g. in RP's CCM I'm probably closest to "difference-making risk-weighted expected utility = low to moderate risk aversion", which for instance assesses RP's default AI risk misalignment megaproject as resulting in, not averting, 300+ DALYs per $1k)
    • Unlike Ben, I'm a relatively young person in a middle-income country, and the abundance mindset feels privileged (i.e. not as much runway to try and fail) 
  • So maybe a precursor / enabling activity for the "sample more" approach above is "more runway-building": money, leisure time, free attention & health, proximity to opportunities(?)

Michael Dickens' 2016 post Evaluation Frameworks (or: When Importance / Neglectedness / Tractability Doesn't Apply) makes the following point I think is useful to keep in mind as a corrective:

INT has its uses, but I believe many people over-apply it. 

Generally speaking (with some exceptions), people don’t choose between causes, they choose between interventions. That is, they don’t prioritize broad focus areas like global poverty or immigration reform. Instead, they choose to support specific interventions such as distributing deworming treatments or lobbying to pass an immigration bill. The INT framework doesn’t apply to interventions as well as it does to causes. In short, cause areas correspond to problems, and interventions correspond to solutions; INT assesses problems, not solutions.

(aside: Michael Plant makes the same point in chapters 5 & 6 of his PhD thesis as per Edo Arad's post, using it as a starting point to develop a systematic cause prio approach he called 'cause mapping')

In most cases, we can try to directly assess the true marginal impact of investing in an intervention. These assessments will never be perfectly accurate, but they generally seem to tell us more than INT does. ... 

How can we estimate an intervention’s impact more directly? To develop a better framework, let’s start with the final result we want and work backward to see how to get it.

Dickens' post has more; the framework they end up with is this:

which (somewhat less practically, they note) could be fine-grained further:

I also appreciated that Dickens actually used this framework to guide their giving decision (more details in their post).

Just came across Max Dalton's 2014 writeup Estimating the cost-effectiveness of research into neglected diseases, part of Owen Cotton-Barratt's project on estimating cost-effectiveness of research and similar activities. Some things that stood out to me:

  • High-level takeaways
    • ~100x 95% CI range (mostly from estimates of total current funding to date, and difficulty of continuing with research), so figures below can't really argue for change in priorities so much as compel further research 
      • This uncertainty is a lower bound, including only statistical uncertainty and not model uncertainty 
    • Differing returns to research are largely driven by disease burden size, so look at diarrheal diseases, malaria, hookworm, ascariasis, trichuriasis, lymphatic filariasis, meningitis, typhoid, and salmonella – i.e. nothing too surprising 
  • Estimated figures: 
    • 13.9 DALYs/$1k for the sector as a whole (vs ~20 DALYs/$1k for GWWC top charities back in 2014), 95% CI 1.43-130 DALYs/$1k 
    • Median estimates: diarrheal disease e.g. cholera and dysentry 121 DALYs/$1k, salmonella infections 74 DALYs/$1k, worms ~50 DALYs/$1k, leprosy 0.058 DALYs/$1k
    • Most of the top diseases have ~100x 95% CI range, except salmonella whose range is ~3,000x(!) 
  • References

List of charities providing humanitarian assistance in the Israel-Hamas war mentioned in response to this request, for posterity and ease of reference:

Curated and popular this week
Relevant opportunities