Hide table of contents

As the effective altruism (EA) movement continues to grow, so does our emphasis on rigorous impact measurement. While this focus on evidence-based giving is commendable, it's crucial to recognize the significant challenges that social sector organizations face in measuring their impact. 


As someone involved in social sector impact measurement and evaluation for a full career, I would point to some difficulties with the current direction of travel and suggest that the EA community's expectations sometimes need adjustment.

 

The Real-World Challenges of Impact Measurement
 

1. Resource Constraints

Many nonprofits operate on shoestring budgets with limited staff. Conducting thorough impact evaluations can be expensive and time-consuming, potentially diverting resources from direct program work. For a small organization providing urgent services, spending a significant portion of their budget on measurement might mean serving fewer people in need.

 

2. Complexity of Social Issues

Unlike controlled laboratory experiments, social interventions often deal with complex, interconnected problems. Isolating the impact of a single program can be extremely difficult. For instance, an education program's success might depend on factors like family income, nutrition, and community support - all of which are hard to control for.

 

3. Long Time Horizons

Some interventions, particularly those addressing systemic issues, may take years or even decades to show measurable results. This timeframe often exceeds typical funding and reporting cycles. Climate change mitigation efforts, for example, might not show clear impacts for generations.

 

4. Diverse Stakeholder Expectations

Different funders, beneficiaries, and regulatory bodies may have varying (and sometimes conflicting) ideas about what constitutes meaningful impact and how it should be measured. An organization might find itself trying to satisfy multiple, incompatible reporting requirements.

 

5. Lack of Standardization

Unlike financial accounting, there are no universally accepted standards for social impact measurement, making comparisons across organizations challenging. This lack of standardization can make it difficult for donors to make informed decisions.
 

6. Data Collection Challenges

Gathering reliable data in resource-poor or crisis-affected areas can be logistically difficult and sometimes dangerous. Imagine trying to conduct a detailed survey in a conflict zone or a remote rural area with poor infrastructure.

 

Why EA Expectations Can Be Unrealistic

 

Given these challenges, some common expectations in the EA community can be out of step with the realities of the social sector:

 

1. Overemphasis on Quantitative Metrics

While numbers are important, they don't tell the whole story. The EA community sometimes undervalues qualitative data or anecdotal evidence that can provide crucial context.

 

2. Preference for RCTs

Randomized controlled trials (RCTs) are powerful, but they're not always feasible or appropriate. Expecting RCT-level evidence for every intervention is often unrealistic and can bias us towards easily measurable interventions.

 

3. Focus on Short-Term Outcomes

Our desire for clear, attributable impact can lead to a focus on short-term, easily measurable outcomes at the expense of important long-term or systemic changes.

 

4. Underestimating Complexity

Social change is messy and non-linear. Our quest for clear cause-and-effect relationships can sometimes oversimplify complex realities.

 

5. Neglecting Local Knowledge

In our pursuit of "objective" data, we may undervalue the insights of local communities and practitioners who have deep, contextual understanding of the issues.

 

6. Problematic Cross-Domain Comparisons

While it's tempting to directly compare the cost-effectiveness of diverse interventions (e.g., global health vs. climate change mitigation), such comparisons often rely on questionable assumptions and may not be truly meaningful.
 

7. Insufficient Appreciation of Capacity Building

Efforts to improve an organization's ability to deliver and measure impact in the long term may not show immediate results but are crucial for sustained change.

 

Bridging the Gap

 

To address these challenges and set more realistic expectations, the EA community could:
 

1. Invest in Capacity Building

Instead of simply demanding better data, we should help organizations develop their impact measurement capabilities. This could involve funding measurement-focused staff positions or providing training and resources.
 

2. Embrace Diverse Evidence

We need to recognize the value of multiple types of evidence, including qualitative and mixed-methods approaches. A compelling case study or ethnographic research can provide insights that numbers alone cannot.

 

3. Fund Long-Term Evaluations

We should support studies that track impact over many years, acknowledging that significant change takes time. This might mean committing to longer funding cycles or creating pooled funds for longitudinal research.

 

4. Collaborate with Practitioners

Working closely with those on the ground can help us develop practical, context-appropriate measurement approaches. We should view social sector workers as partners, not just as subjects of evaluation.

 

5. Advocate for Shared Standards

Supporting efforts to develop flexible, widely-applicable frameworks for impact measurement in the social sector can help make evaluation more accessible and comparable.

 

Towards More Effective Altruism

 

By recognizing these challenges and adjusting our expectations, we can foster a more constructive dialogue between the EA community and the broader social sector. This doesn't mean abandoning our commitment to evidence and effectiveness. Rather, it means expanding our understanding of what constitutes valuable evidence and meaningful impact.

 

As we refine our approach to impact measurement, we'll be better equipped to support truly effective interventions - even when their impacts are complex, long-term, or hard to measure. This more nuanced perspective will ultimately lead to more effective and sustainable positive impact, bringing us closer to the goals that drive the EA movement.

 

Remember, the aim isn't just to measure impact effectively, but to create real, lasting positive change in the world. By bridging the gap between ideals and realities, we can do both.


 

Comments


No comments on this post yet.
Be the first to respond.
Curated and popular this week
LintzA
 ·  · 15m read
 · 
Cross-posted to Lesswrong Introduction Several developments over the past few months should cause you to re-evaluate what you are doing. These include: 1. Updates toward short timelines 2. The Trump presidency 3. The o1 (inference-time compute scaling) paradigm 4. Deepseek 5. Stargate/AI datacenter spending 6. Increased internal deployment 7. Absence of AI x-risk/safety considerations in mainstream AI discourse Taken together, these are enough to render many existing AI governance strategies obsolete (and probably some technical safety strategies too). There's a good chance we're entering crunch time and that should absolutely affect your theory of change and what you plan to work on. In this piece I try to give a quick summary of these developments and think through the broader implications these have for AI safety. At the end of the piece I give some quick initial thoughts on how these developments affect what safety-concerned folks should be prioritizing. These are early days and I expect many of my takes will shift, look forward to discussing in the comments!  Implications of recent developments Updates toward short timelines There’s general agreement that timelines are likely to be far shorter than most expected. Both Sam Altman and Dario Amodei have recently said they expect AGI within the next 3 years. Anecdotally, nearly everyone I know or have heard of who was expecting longer timelines has updated significantly toward short timelines (<5 years). E.g. Ajeya’s median estimate is that 99% of fully-remote jobs will be automatable in roughly 6-8 years, 5+ years earlier than her 2023 estimate. On a quick look, prediction markets seem to have shifted to short timelines (e.g. Metaculus[1] & Manifold appear to have roughly 2030 median timelines to AGI, though haven’t moved dramatically in recent months). We’ve consistently seen performance on benchmarks far exceed what most predicted. Most recently, Epoch was surprised to see OpenAI’s o3 model achi
Dr Kassim
 ·  · 4m read
 · 
Hey everyone, I’ve been going through the EA Introductory Program, and I have to admit some of these ideas make sense, but others leave me with more questions than answers. I’m trying to wrap my head around certain core EA principles, and the more I think about them, the more I wonder: Am I misunderstanding, or are there blind spots in EA’s approach? I’d really love to hear what others think. Maybe you can help me clarify some of my doubts. Or maybe you share the same reservations? Let’s talk. Cause Prioritization. Does It Ignore Political and Social Reality? EA focuses on doing the most good per dollar, which makes sense in theory. But does it hold up when you apply it to real world contexts especially in countries like Uganda? Take malaria prevention. It’s a top EA cause because it’s highly cost effective $5,000 can save a life through bed nets (GiveWell, 2023). But what happens when government corruption or instability disrupts these programs? The Global Fund scandal in Uganda saw $1.6 million in malaria aid mismanaged (Global Fund Audit Report, 2016). If money isn’t reaching the people it’s meant to help, is it really the best use of resources? And what about leadership changes? Policies shift unpredictably here. A national animal welfare initiative I supported lost momentum when political priorities changed. How does EA factor in these uncertainties when prioritizing causes? It feels like EA assumes a stable world where money always achieves the intended impact. But what if that’s not the world we live in? Long termism. A Luxury When the Present Is in Crisis? I get why long termists argue that future people matter. But should we really prioritize them over people suffering today? Long termism tells us that existential risks like AI could wipe out trillions of future lives. But in Uganda, we’re losing lives now—1,500+ die from rabies annually (WHO, 2021), and 41% of children suffer from stunting due to malnutrition (UNICEF, 2022). These are preventable d
 ·  · 9m read
 · 
TL;DR In a sentence:  We are shifting our strategic focus to put our proactive effort towards helping people work on safely navigating the transition to a world with AGI, while keeping our existing content up. In more detail: We think it’s plausible that frontier AI companies will develop AGI by 2030. Given the significant risks involved, and the fairly limited amount of work that’s been done to reduce these risks, 80,000 Hours is adopting a new strategic approach to focus our efforts in this area.   During 2025, we are prioritising: 1. Deepening our understanding as an organisation of how to improve the chances that the development of AI goes well 2. Communicating why and how people can contribute to reducing the risks 3. Connecting our users with impactful roles in this field 4. And fostering an internal culture which helps us to achieve these goals We remain focused on impactful careers, and we plan to keep our existing written and audio content accessible to users. However, we are narrowing our focus as we think that most of the very best ways to have impact with one’s career now involve helping make the transition to a world with AGI go well.   This post goes into more detail on why we’ve updated our strategic direction, how we hope to achieve it, what we think the community implications might be, and answers some potential questions. Why we’re updating our strategic direction Since 2016, we've ranked ‘risks from artificial intelligence’ as our top pressing problem. Whilst we’ve provided research and support on how to work on reducing AI risks since that point (and before!), we’ve put in varying amounts of investment over time and between programmes. We think we should consolidate our effort and focus because:   * We think that AGI by 2030 is plausible — and this is much sooner than most of us would have predicted 5 years ago. This is far from guaranteed, but we think the view is compelling based on analysis of the current flow of inputs into AI
Recent opportunities in Building effective altruism