Effective Altruism Forum

Comments 11

Sorted by

New & upvoted

This is a great story! Good motivational content.

But I do think, in general, a mindset of "only I can do this" is innacurate and has costs. There are plenty of other people in the world, and other communities in the world, attempting to do good, and often succeeding. I think EAs have been a small fraction of the success in reducing global poverty over the last few decades, for example.

Here are a few plausible costs to me:

Knowing when and why others will do things significantly changes estimates of the marginal value of acting. For example, if you are starting a new project, it's reasonably likely that even if you have a completely new idea, other people will be in similar epistemic situations as you, and will soon stumble upon the same idea. So to estimate your counterfactual impact you might want to be estimating how much earlier something will occur because you made it occur, rather than purely the impact of the thing occurring. More generally, neglectedness is a key part of estimating your marginal impact - and estimating neglectedness relies heavily on an understanding of what others are focusing on, and usually at least a few people are doing things in a similar space to you.
Also, knowing when and why others will do things affects strategic considerations. The fact that in many places we now try to do good there are few non-EAs working there is a result of our attempts to find neglected areas. But - especially in the case of x-risk - we can expect others to begin to do good work in these areas as time progresses (see e.g. AI discussions around warning shots). The extent to which this is the case affects what is valuable to do now.

Vaidehi Agarwalla 🔸

I really like these nuances. I think one of the problems with the drowning child parable / early EA thinking more generally was (and still is, to a large extent) very focused on the actions of the individual.

It's definitely easier and more accurate to model individual behavior, but I think we (as a community) could do more to improve our models of group behavior even though it's more difficult and costly to do so.

Linch

Minor, but

Many readers will be familiar with Peter Singer’s Drowning Child experiment:

Should be

Peter Singer's Drowning Child thought experiment.

A "Drowning Child experiment" will be substantially more concerning

tlevin

This comment co-written with Jake McKinnon:

The post seems obviously true when the lifeguards are the general experts and authorities, who just tend not to see or care about the drowning children at all. It's more ambiguous when the lifeguards are highly-regarded EAs.

It's super important to try to get EAs to be more agentic and skeptical that more established people "have things under control." In my model, the median EA is probably too deferential and should be nudged in the direction of "go save the children even though the lifeguards are ignoring them." People need to be building their own models (even if they start by copying someone else's model, which is better than copying their outputs!) so they can identify the cases where the lifeguards are messing up.
However, sometimes the lifeguards aren't saving the children because the water is full of alligators or something. Like, lots of the initial ideas that very early EAs have about how to save the child are in fact ignorant about the nature of the problem (a common one is a version of "let's just build the aligned AI first"). If people overcorrect to "the lifeguards aren't doing anything," then when the lifeguards tell them why their idea is dangerous, they'll ignore them.

The synthesis here is something like: it's very important that you understand why the lifeguards aren't saving the children. Sometimes it's because they're missing key information, not personally well-suited to the task, exhausted from saving other children, or making a prioritization/judgment error in a way that you have some reason to think your judgment is better. But sometimes it's the alligators! Most ideas for solving problems are bad, so your prior should be that if you have an idea, and it's not being tried, probably the idea is bad; if you have inside-view reasons to think that it's good, you should talk to the lifeguards to see if they've already considered this or think you will do harm.

Finally, it's worth noting that even when the lifeguards are competent and correctly prioritizing, sometimes the job is just too hard for them to succeed with their current capabilities. Lots of top EAs are already working on AI alignment in not-obviously-misguided ways, but it turns out that it's a very very very hard problem, and we need more great lifeguards! (This is not saying that you need to go to "lifeguard school," i.e. getting the standard credentials and experiences before you start actually helping, but probably the way to start helping the lifeguards involves learning what the lifeguards think by reading them or talking to them so you can better understand how to help.)

Ruby

Good comment!!

Most ideas for solving problems are bad, so your prior should be that if you have an idea, and it's not being tried, probably the idea is bad;

A key thing here is to be able to accurately judge whether the idea would be harmful if tried or not. "Prior is bad idea != EV is negative". If the idea is a random research direction, probably won't hurt anyone if you try it. On the other hand, for example, certain kinds of community coordination attempts deplete a common resource and interfere with other attempts, so the fact no one else is acting is a reason to hesitate.

Going to people who you think maybe ought to be acting and asking them why they're not doing a thing is probably a thing that should be encouraged and welcomed? I expect in most cases the answer will be "lack of time" rather than anything more substantial.

Ruby

In terms of thinking about why solutions haven't been attempted, I'll plug Inadequate Equilibria. Though it probably provides a better explanation for why problems in the broader world haven't been addressed. I don't think the EA world is yet in an equilibrium and so things don't get done because {it's genuinely a bad idea, it seems like the thing you shouldn't be unilateral on and no one has built consensus, sheer lack of time}.

TW123

EA groups often get criticized by university students for "not doing anything." The answer usually given (which I think is mostly correct!) is that the vast majority of your impact will come from your career, and university is about gaining the skills you need to be able to do that. I usually say that EA will help you make an impact throughout your life, including after you leave college; the actions people usually think of as "doing things" in college (like volunteering), though they may be admirable, don't.

Which is why I find it strange that the post doesn't mention the possibility of becoming a lifeguard.

In this story, the lifeguards aren't noticing. Maybe they're complacent. Maybe they don't care about their jobs very much. Maybe they just aren't very good at noticing. Maybe they aren't actually lifeguards at all, and they just pretend to be lifeguards. Maybe the entire concept of "lifeguarding" is just a farce.

But if it's really just that they aren't noticing, and you are noticing, you should think about whether it really makes sense to jump into the water and start saving children. Yes, the children are drowning, but no, you aren't qualified to save them. You don't know how to swim that well, you don't know how to carry children out of the water, and you certainly don't know how to do CPR. If you really want to save lives, go get some lifeguard training and come back and save far more children.

But maybe the children are dying now, and this is the only time they're dying, so once you become a lifeguard it will be too late to do anything. Then go try saving children now!

Or maybe going to lifeguard school will destroy your ability to notice drowning children. In that case, maybe you should try to invent lifeguarding from scratch.

But unless all expertise is useless and worthless, which it might be in some cases, it's at least worth considering whether you should be focused on becoming a good lifeguard.

deep

Thanks for this post! I always appreciate a pretty metaphor, and I generally agree that junior EAs should be less deferential and more ambitious. Maybe most readers will in fact mostly take away the healthy lesson of "don't defer", which would be great! But I worry a bit about the urgent tone of "act now, it's all on you", which I think can lead in some unhealthy directions.

To me, it felt like a missing mood within the piece was concern for the reader's well-being. The concept of heroic responsibility is in some ways very beautiful and important to me, but I worry that it can very easily mess people up more than it causes them to do good. (Do heroic responsibility responsibly, kids.)

When you feel like there are no lifeguards, and drowning children are everywhere, it's easy to exhaust yourself before you even get to the point of saving anyone at all. I've seen of people burn themselves out over projects that, while promising, were really not organized with their sustainable well-being in mind.

If I were to write a version of this piece that reflected my approach to doing good, maybe I'd try to find a different metaphor that framed it more as an iterated game, to make it more natural to say something about conserving your strength / nurturing yourself / marathon-not-a-sprint.

Some other comments I particularly resonated with: @levin's point about negative side effects due to unilateralist uninformed action, and @VaidehiAgarwalla's point about implicitly reflecting an Eliezerish view of AI risk. I think the latter is part of what triggered my worry about this post potentially crushing people under the weight of responsibility.

Vaidehi Agarwalla 🔸

A comment from a friend (I've paraphrased a bit):

In this post two things stand out:
This advice seems to be particularly targeted at college students / undergraduates / people early in their careers (based on Section 2) and I expect many undergraduates might read this post.
Your post links to 2 articles from Eliezer Yudkowsky's / MIRI's perspective of AI alignment, which is a (but importantly, not the only) perspective of alignment research that is particularly dire. Also, several people working on alignment do in fact have plans (link to vanessa kosoy), even if they are skeptical they will work.
The way that these articles are linked assumes they are an accepted view or presents them in a fairly unnuanced way which seems concerning, especially coupled with the framing of "we have to save the world" (which Benjamin Hilton has commented on).

James Aitchison

How much should you do ‘off your own bat‘ (to use the British cricket idiom)? Well, most value comes from people working in their roles, or from working with others to create change, but sometimes there are opportunities that would be missed without an individual going out on a limb.

Sharmake

The real problem is that in large scale problems like AI safety, progress is usually continuous, not discrete. This we can talk about partial alignment problems, which realistically is the best EA/LessWrong can do. I don't expect them to ever be able to get AI to be particularly moral or not destabilize society, but existential catastrophe is likely to be avoided.

Also, I'm going to steal part of Vaidehi Agarwalla's comment and improve upon it here:

Your post links to 2 articles from Eliezer Yudkowsky's / MIRI's perspective of AI alignment, which is a (but importantly, not the only) perspective of alignment research that is an outlier in it's direness. We have good reason to believe that this caused by unnecessary discreteness in their framing of the AI Alignment problem.

Comments

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·6d ago·Curated 16h ago·22m read

Was Partisanship Good for the Environmental Movement?

Jeffrey Heninger·2y ago·Curated 6d ago·6m read

This is the third in a sequence of posts taken from my recent report: Why Did Environmentalism Become Partisan? Summary Rising partisanship did not make environmentalism more popular or politically effective. Instead, it saw flat or falling overall public opinion, fewer major legislative achievements, and fluctuating executive actions. Public Opinion...

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·2d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Recent opportunities to take action

RP is looking for project founders in neglected animal areas

Rethink Priorities·10h ago·7m read

Time Sensitive Do Gooding Opportunities

Bentham's Bulldog·11h ago·5m read

146

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·1w ago·4m read

tlevin

This comment co-written with Jake McKinnon:

It's super important to try to get EAs to be more agentic and skeptical that more established people "have things under control." In my model, the median EA is probably too deferential and should be nudged in the direction of "go save the children even though the lifeguards are ignoring them." People need to be building their own models (even if they start by copying someone else's model, which is better than copying their outputs!) so they can identify the cases where the lifeguards are messing up.
However, sometimes the lifeguards aren't saving the children because the water is full of alligators or something. Like, lots of the initial ideas that very early EAs have about how to save the child are in fact ignorant about the nature of the problem (a common one is a version of "let's just build the aligned AI first"). If people overcorrect to "the lifeguards aren't doing anything," then when the lifeguards tell them why their idea is dangerous, they'll ignore them.