Bio

Participation
3

Director of Research at CEARCH: https://exploratory-altruism.org/

I construct cost-effectiveness analyses of various cause areas, identifying the most promising opportunities for impactful work.

Previously a teacher in London, UK.

Sequences
1

Mental Health Report (CEARCH)

Comments
110

Topic contributions
1

I wonder whether it is worth for me to invest significant time in writing comments like mine above. It seems that they are often downvoted, and that I can sometimes tell before hand when this is going to be case. So, to the extent karma is a good proxy for what people value, I wonder whether I am just spending signicant time on doing something which has little value.

I am sad to see your comment getting downvotes as I do think it contributes a lot of value to the discussion.

I can guess why you might be getting them. You often respond to cause-prio posts with "what about corporate campaigns for chicken welfare?", and many people now probably switch off and downvote when they see this. Maybe just keep the chicken comparison to one line and link to your original post/comment?

Also, you comment is 3200 words long - about 3x longer than the actual post. I think a 200-word summary-of-the-comment with bullet points would be really useful for readers who have only read this post and are unable to pick up the finer points of your modeling critique.

On animal welfare

  1. I think that if you adopt RP's moral weight estimates and reject speciesism, it is almost inevitable that the most cost-effective interventions to improve wellbeing will be animal welfare interventions.
  2. My understanding is that CEARCH is not against evaluating animal welfare interventions in principle, but in practice we are not doing so while comparisons between human and animal welfare remain so shaky. Our research direction is also partly driven by the value of information, ie. how much resources we can plausibly redirect and the impact this will have. Maybe this is too deterministic of me, but I feel that banging the drum about corporate chicken campaigns will only open so many wallets.

Hi Mike,

Firstly, thanks to you and all of ALLFED for your willingness to let me prod and poke at your work in the past year.

You make some excellent points and I think they will help readers to decide where they stand on the important cruxes here.

We assign a higher probability that a nuclear conflict occurs compared to your estimates, and also assume that conditional on a nuclear conflict occurring that higher detonation totals are likely. This raises the likelihood and severity of nuclear winters versus your estimates.

For anyone wanting to get up to speed on my nuclear winter model, plus a quick intro to why nuclear cooling is so uncertain, see my just-released nuclear winter post.

  • Weightings of Metaculus, XPT vs individual estimates: we do place high weightings on individual's estimates, which is not ideal. The main reason for this is that the Metaculus and XPT estimates are not calibrated to a 100+ detonation nuclear conflict involving US and/or Russia.
    • The Metaculus estimate still forms around 20% of the total weight. Overall, it seems credible that my final estimate (0.10%) is around a third of the Metaculus estimate (0.30%) for 100+ detonations, given that most experts I spoke to considered India/Pakistan to be one of the main threats of a 100+ detonation nuclear exchange. 
    • The XPT estimate only forms around 2% of the total weight.
  • Distribution of the number of detonations in a 100+ detonation conflict. I have done my best to incorporate different sources of evidence. One expert I spoke to broadly agreed with you, Mike, and guessed that in a 100+ conflict involving US/Russia there would be 80-90% risk of over 1000 detonations. Another expert was less pessimistic and placed higher weight on the possibility of 'moderate' escalation involving hundreds of weapons.
    • I re-ran the numbers just looking at conflicts with over 1000 detonations and found that the cooling levels increase only modestly: the 75th-percentile cooling level increases from 0.66 degrees (100+ detonations) to 1.24 degrees (1000+ detonations). The initial assumption about soot volumes is far more important (see below)
  • Detonation-soot relationship: Yes, the results are extremely sensitive to the initial assumption about how much soot is produced in a small nuclear exchange targeting cities. When I re-ran the numbers under the more pessimistic assumption that soot levels were 10%-100% those predicted by Toon et al., 75th-percentile cooling increased from 0.66 degrees to 3 degrees.

We estimate that the expected mortality from supervolcanic eruptions (VEI 8+) would be comparable to VEI 7 eruptions, so their inclusion could increase cost effectiveness significantly.

I don't exclude supereruptions; I estimate that the right tail of my volcanic cooling model already accounts for them.

We feel that you are selling short the importance of research in building resilience to nuclear winters in particular and ASRSs in general [...] Overall, we see research as the foundation on which you then build the policy work and other actions. Broadening and strengthening this foundation is therefore vital in allowing the work that finally effects change to occur - it isn’t an either/or.

I want to be clear that I recommend that funders prioritize policy advocacy over R&D on the margin at this point in time. I totally agree that advocacy on such an uncertain topic can only be effective if it is grounded in research, and that ALLFED's research will very likely form the foundations of policy work in this area for years to come.

One key takeaway from my analysis is that mild and moderate scenarios form a larger proportion of the threat than the lore of nuclear winter might suggest. Resilient foods would likely have a role to play in these scenarios, but I think the calories at stake in distribution and adaptation are likely to be more pivotal.

One reason for my focus on resilient food pilot studies is that they are a possible next step for ALLFED if it were to receive a funding boost. ALLFED has been ticking along with modest but reliable core funding for some time now, and perhaps I am guilty of taking its theoretical research for granted.

Feel free to set the record straight and give some indication of the kinds of work ALLFED might be interested in accepting funding for.

Good point.

I looked at WTO agreements early in the research process and eventually decided that WTO advocacy was probably not, on the margin, the best way forward.

Food stockholding

I focused on the consequences of the AoA for food stockpiling (known as "stockholding"), the most urgent concern being that it may dissuade countries from stockholding as much as they otherwise would (as suggested by Adin Richards here). Although food reserves would never be big enough to get us through a full catastrophe, they would buy time for countries to adapt to the cooling shock.

The feedback I got from someone with experience at the WTO since the 1990's was

  • The AoA probably isn't holding countries back from stockholding. It does not make any restrictions on how much a country can hold, only on how much a country can spend on subsidizing domestically-produced stocks [while technically true, a lot of authors certainly seem to believe that the AoA affects stockholding, eg. here].
  • This AoA appears to be counterfactually reducing subsidies for only a handful of products in a few countries. India is the only country to have notified exceeding its limit, and only for rice.
  • The AoA seems intractable. The G33, a coalition of LMICs, have pushed for change with no success

On the other hand, basic amendment to the AoA seems obviously needed (imho). The original agreement does not properly allow for inflation. It does not make adequate exceptions for very low-income countries (whose market share is so low that allowing them to subsidize farmers would not be very distortionary).

Overall the WTO seems deadlocked at the moment and suffering a crisis of legitimacy. Tit-for-tat between US and China has led to a breakdown of trust in the organization.

If the US was on-side for AoA amendment, it is possible that the other dissenting countries would fall in line. But it is not clear that the US can be influenced on this. The US is doing fine with the system as it currently is, and has other ways of subsidizing domestic agriculture.

Other WTO theories of change

I don't know much about the implications of the AoA beyond stockholding. 

The most important things is that we ensure trade continues in a catastrophe, which seems congruent with the AoA. The second most important thing is that countries are able to quickly adapt their food systems in a crisis. In a major catastrophe I think all WTO rules would go out of the window. But could the AoA prevent countries from preparing?

  • It may hold them back from stockholding, or producing much more food than the market demands. But these measures would likely be way too expensive for governments to support in the name of preparing for a 1-in-400 event, anyway.
  • It may prevent them from using agri subsidies to foster emerging, resilient food sources that are not economically viable in normal times. But these food sources are at an early stage of development - pilot studies at most - and my guess is that they could be pursued under academic or innovation budgets.

I may well be missing something. Are there other ways the AoA could frustrate resilience work?

Hi Vasco. Thanks for all of your help giving feedback on the report and the modeling underpinning the CEA.

I am going to focus on the main points that you make. I hope to explain why I chose not to adopt the changes you mention in your comment and also to highlight some key weaknesses and limitations of my model.

Points I address (paraphrasing what you said):

  • By asking domain experts you probably got an overestimate for "probability that advocacy succeeds". You should have also asked people in other fields.
  • Although you mention fungibility, you don't account for it in cost-effectiveness estimates. You should be more explicit that fungibility undermines cost-effectiveness of grants that perform other, less effective interventions.
  • You overestimate the mortality effects of mild cooling events: if we apply your model to the 1815 eruption, we get higher mortality rates than actually occurred.
  • Poverty is a strong predictor of famine mortality. If your persistence estimate relied only on poverty & malnutrition burden indicators, the full-term benefits of policy advocacy would be significantly lower

By asking domain experts you probably got an overestimate for "probability that advocacy succeeds". You should have also asked people in other fields.

  • I agree that domain experts are likely to overestimate the probability of successful policy advocacy in their space.
  • In my defence, only two of the seven experts I consulted for estimates worked in food resilience specifically. The geometric mean of their estimates was 30%; only slightly higher than the group average of 24%. The other experts would be best classified as GCR experts (so still likely to be overly optimistic)
  • The difficulty is that people who are not domain experts are (by definition) not well-informed. I don't think people at GiveWell will have an accurate understanding of prospects for ASRS resilience policy advocacy. Especially because this is a very small field.

Although you mention fungibility, you don't account for it in cost-effectiveness estimates. You should be more explicit that fungibility undermines cost-effectiveness of grants that perform other, less effective interventions.

I think that in ideal circumstances, fungibility should be accounted for in cost-effectiveness analysis. But since it depends on the organization receiving the funding, I decided not to do quantitative estimates of fungibility effects in this report. Maybe we will do so when we evaluate specific grants in this area.

I agree that funding to orgs who only do one highly cost-effective thing is generally less fungible.

You overestimate the mortality effects of mild cooling events: if we apply your model to the 1815 eruption, we get higher mortality rates than actually occurred.

I love this analysis, thanks for doing it.

First, let me say that yes, my model is very sensitive to mortality estimates in mild cooling scenarios. My estimate may be too high, but I believe there are compelling reasons not to be confident of this.

To illustrate my model for mortality in a mild cooling event (1-2.65 degrees cooling):

  • 2% probability of 6% mortality (no adaptation, no food trade)
  • 18% probability of 3% mortality (no food trade)
  • 80% probability of no mortality
  • This gives an average of approximately 0.6% mortality

My counterarguments are as follows:

  • On a broad level, I think that a 'panic' scenario could include countries banning food exports to secure domestic supplies. This would be catastrophic for some food-importing countries even in normal climate conditions. The same cannot be said for the world of 1815, where almost all food was consumed locally and very few people lived far from agricultural areas.
  • I think the comparison with 1815 is well worth doing. However, there are a number of reasons why the validity of the comparison is limited:
    • A global 1% mortality event in 1815 may not have even been noticed. We have to patch together estimates of famine mortality in 1815 because there was almost no systematic documentation at the time. People were not aware of any global phenomenon, so nobody was trying to "join up the dots" and tally the full famine impact that year. Especially if much of the effects were felt in South Asia, East Asia or Africa, it is possible that a major-yet-distributed famine could have gone unnoticed
    • Relatively few people in the world of 1815 relied on food imports, as mentioned above[1]. Breakdown in international food trade is the main famine mechanism in modern-day agricultural catastrophes, but it barely applies to the agrarian economies of 1815.
    • Local famine effects may not have been worse in Indonesia. Undoubtedly, the effects of ash blanketing crops would have been worse near the eruption. But stratospheric soot quickly circulates around the world. Furthermore, Indonesia has a warm climate and would not have been at risk from completely failed harvests through unseasonal frost. Multiple annual harvests are common in the tropics; in higher latitudes, a failed crop leaves farmers without food for almost a year.
    • Cooling damage is highly superlinear. The Pinatubo eruption of 1991 caused 0.5 degrees of cooling and is not associated with important declines in agricultural productivity. Thus we might expect the expected burden of an 1815-level cooling event (0.8 to 1.3 degrees cooling) to be far lower than a 1-2.65 degree cooling event[2].

Poverty is a strong predictor of famine mortality. If your persistence estimate relied only on poverty & malnutrition burden indicators, the full-term benefits of policy advocacy would be significantly lower.

To push back:

  • A world with no extreme poverty and no nutritional diseases would still be vulnerable to global agricultural catastrophes. The advantage of my "grain production per capita" metric is that it has no such edge-case problems: more grain is always good for food security.
  • The famine/poverty relationship is strong in normal times. There are a number of reasons that it may become less strong in an agricultural catastrophe
    • Many of the global poor are subsistence farmers in warm countries. This demographic will be close to food supplies and away from the risk of frosts etc.. They may be better-positioned to survive than their middle-class compatriots in cities.
    • People at the bottom of the pile are the most at-risk of post-catastrophe famine, regardless of whether they are in absolute poverty. A wealthier world would simply have higher food prices, leaving the poorest without enough to eat.
    • As described above, the main cause of famine in my model is the breakdown of food trade. It is not clear that progress against poverty and nutritional diseases is an indication that populations are less dependent on food trade. If anything, development is probably associated with increased reliance on trade.

Thanks again for the detailed feedback!!

 

  1. ^

    Admittedly, this would have made some people more vulnerable as it was difficult to relieve famine-stricken areas.

  2. ^

    A counterargument could be that Western Europe appears to have had particularly bad summer cooling in 1816 - as well as better record-keeping than much of the world - and their famines were not so bad. On the other hand, spring cooling may be more important, as late frosts can ruin harvests of wheat, potatoes etc.

Great tool! I couldn't figure how to go back when I mis-clicked an answer, and ended up losing my progress.

I'll rewrite completely because I didn't explain myself very clearly

  • 10,000 participants is possible since they are using Whatsapp, in a large country, and recruiting users does not seem to be a bottleneck
  • 10,000 participants is relevant as it represents the scale they might hope to expand to at the next stage
  • Presumably they used the number 10,000 to estimate the cost-per-treatment by finding the marginal cost per treatment and adding 1/10,000th of their expected fixed costs.
    • So if they were to expand to 100,000 or 1,000,000 participants, the cost-per-treatment would be even lower.

My guess is that a WhatsApp-based MH intervention would be almost arbitrarily scalable. 10 000 participants ($300,000) may reflect the scale of the grants they are looking for.

I would like to push back slightly on your second point: Secondly, isn't it a massive problem that you only look at the 27% that completed the program when presenting results?

By restricting to the people who completed the program, we get to understand the effect that the program itself has. This is important for understanding its therapeutic value.

Retention is also important - it is usually the biggest challenge for online or self-help mental health interventions, and it is practically a given that many people will not complete the course of treatment. 27% tells us a lot about how "sticky" the program was. It lies between the typical retention rates of pure self-help interventions and face-to-face therapy, as we would expect for an in-between intervention like this.

More important than effect size and retention - I would argue - is the topline cost-effectiveness in depression averted per $1,000 or something like that. This we can easily estimate from retention rate, effect size and cost-per-treatment.

Zakat is paid on wealth, not income, so GDP is not a great proxy. Globally there appears to be $450tr in wealth and $100tr in GDP, so perhaps multiplying GDP by 4.5 gives a decent estimate for wealth.

Also global GDP increased 43% between 2010 and 2022.

Thanks for sharing your calculations!

I'd recommend against thinking along the lines of what Muslim wealth would have to be to make the 1tr figure plausible, given that the figure seems to be made up. But I definitely agree with the idea of forming multiple estimates using different approaches.

Some ideas for getting a good figure:

  • The combined GDP of Organisation of Islamic Cooperation countries is 10tr, about 10% of global GDP. This does not include most muslims in countries where they are a minority (importantly, India, Europe and North America) but it does include non-muslims in OIC countries
    • Straight off the bat we can see that $1tr is implausible. 
    • If we assume that OIC countries have 10% of global wealth, this would be $40-70Trn according to your estimate. This implies $1-1.75bn of wealth eligible for Zakat.
    • Where is all that wealth? Given that, as you say, a big chunk of wealth is help by the top 1%, I would guess that most of the wealth is in shares of companies, plus property and other physical assets. The value of such assets can easily be hidden or obfuscated by those who do not wish to pay Zakat on it.
  • It seems like Zakat is enforced by law in Saudi, and collected & used by the state(!) [source 1, 2]. So the $18bn figure is probably quite reliable. 
    • Saudi GDP is about 1% of global GDP. Assuming Saudi therefore has 1% of global wealth (probably an underestimate because oil), and that 20% is a hard limit on the share of global wealth owned by muslims, I would guess 20*$18bn = $360bn is a very hard upper limit on global Zakat
    • More realistically I would assume that 10% of global wealth is owned by Muslims eligible for Zakat and that payment rates outside of Saudi are much lower - I'll say 75% lower[1]. Then global Zakat would be $18bn*(1+9*0.25) = $85.5bn
    • Importantly, it seems like Zakat paid in Saudi is not influencable as it is kept by the state. Therefore my best estimate of the amount of influencable Zakat is $67.5bn worldwide.
  • As a sense-check: where are the Gaza billions? The war in Gaza was a huge huge issue in the Muslim world this past Ramadan, and presumably a lot of Zakat donations went to helping Gazans. If it was ins the tens or hundreds of billions (ie at least 10% of $200+bn), that would be $10,000s for each person in Gaza, which should be easy to spot. But it is not coming up in some brief googling.
  1. ^

    I imagine the main factors are (1) simply underpaying or not paying because it must hurt to give away 2.5% of one's wealth each year and (2) fudging by underrating one's wealth (not counting or being naive about the value of one's house, livestock, car etc.)

Load more