The Top AI Safety Bets for 2023: GiveWiki’s Latest Recommendations

Dawn Drescher

The Top AI Safety Bets for 2023: GiveWiki’s Latest Recommendations

Comments 4

Sorted by

New & upvoted

calebp

I feel very confused as to how I should update based on these results, how the ranking was made and what the ranking means.

A few concrete questions

are credits tracking cost-effectiveness on the current margin or something else?
is a project with 2 credits twice as [cost-effective] as a project with 1 credit?

Is the qualitative summary a pretty good depiction of the main points that contributed to that project's scores?

E.g.

AI Safety Support AI Alignment Slack: Invaluable for information distribution. One evaluator mentioned the numerous times that they found out about opportunities through this Slack.

Lots of Links page: “The best collection of resources we currently have,” but with a big difference between the quality and the impact score: “It could be better organized and more up to date (even at a time when it was still maintained).”

Dawn Drescher

First it could make sense not to focus too much on the credits. The ranking has to bottom out somewhere, and that's where the credits come into it, to establish a track record for our donors. The ranking itself is better thought of as the level of endorsement of a project weighed by the track record of the endorsing donors.

We're still thinking about how we want to introduce funding goals and thus some approximation of short-term marginal utility. At the moment all projects discount donations at the same rate. Ideally we'd be able to use something like the S-Process to generate marginal utility curves that discount the score “payout” that donors can get. I've experimented with funding goals around $100k per project, and 10x sharper discounts afterwards, but it hadn't made enough of a difference that would've legitimized the increased complexity and assumptions. Maybe we'll revive that feature as a configurable funding goal at some point. But there is also the fundamental problem that we don't have access to complete lists of donations, so less popular, less well-maintained projects would seemingly have higher marginal utility just because their donation records are more incomplete. That would be an annoying incentive to introduce. Those problems paired with the minor, unconvincing results of my experiments have caused me not to prioritize this yet.

But when it comes to the credits, the instructions to the evaluators are probably a good guide:

“Imagine that you’re given a budget of 1,000 impact credits to allocate across projects (not artifacts). (a) Please allocate them to the projects in proportion to how impactful you think they were. (b) There’s no transaction cost and no change in marginal utility (the first credit is worth the same as the last to a project). (c) We’ll … average your scores, multiply the averages with the number of evaluators, normalize, and then allocate that product to minimize the impact of recusals.”
And further down: “Note that this is a purely retroactive evaluation. (a) You can ignore the tractability of producing an output since they’ve all been produced. (b) Likewise please ignore the cost at which the output was produced. (c) Do consider neglectedness, though, and consider how likely some equivalent output would’ve been produced anyway had it not been for the given project. (d) Consider the ex ante expected utility. A bullshit project mustn’t get a high score because it somehow got unpredictably lucky. (Fictional examples.)”

So like everything in our evaluation, the credits are retroactive too, so they are not about the current margin. One reason to ignore costs is that we don't have the data, though we might request or estimate it next time around. But the other reason is that the donors to overly expensive projects have already gotten “punish” for their nonoptimal investment through the opportunity cost that they've paid. Intuitively it seems to me like it would be double-counting to also reduce the credits that they receive.

calebp

So is it reasonable to interpret your process as saying FAR was similarly impactful to AI safety events over the last year?

Dawn Drescher

AI Safety Events is one of the projects where we expanded the time window because they were on a hiatus in earlier 2023. The events that got evaluated were from 2022. Otherwise yes. (But just to be clear, this is about the retroactive evaluation results mentioned at the bottom of the post.)

Comments

More from the author

With the Future of the World in Your Hands, Think for 6.77 Years!

Dawn Drescher·11mo ago·12m read

Play Regrantor: Move up to $250,000 to Your Top High-Impact Projects!

Dawn Drescher, Greg_Colbourn ⏸️·3y ago·2m read

Are there highly leveraged donation opportunities to prevent wars and dictatorships?

Dawn Drescher·4y ago·1m read

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·1w ago·Curated 2d ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

173

The first video from Giving What We Can's new channel is out now!

JustinPortela·4d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

New round of digital minds funding opportunities at Longview

zdgroff, Longview Philanthropy·5d ago·2m read

This is a linkpost for Request for Proposals: Research and Applied Work on Digital Minds. I'm glad to announce a request for proposals for research and applied work on digital minds at Longview Ph...

Recent opportunities to take action

A huge way you can help pigs in 5-20 minutes (in the US)

ElliotTep·2d ago·1m read

173

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·2w ago·4m read

PauseCon London '26: Applications now open

Jonathan@PauseAI·1d ago·1m read

Dawn Drescher

But when it comes to the credits, the instructions to the evaluators are probably a good guide:

“Imagine that you’re given a budget of 1,000 impact credits to allocate across projects (not artifacts). (a) Please allocate them to the projects in proportion to how impactful you think they were. (b) There’s no transaction cost and no change in marginal utility (the first credit is worth the same as the last to a project). (c) We’ll … average your scores, multiply the averages with the number of evaluators, normalize, and then allocate that product to minimize the impact of recusals.”
And further down: “Note that this is a purely retroactive evaluation. (a) You can ignore the tractability of producing an output since they’ve all been produced. (b) Likewise please ignore the cost at which the output was produced. (c) Do consider neglectedness, though, and consider how likely some equivalent output would’ve been produced anyway had it not been for the given project. (d) Consider the ex ante expected utility. A bullshit project mustn’t get a high score because it somehow got unpredictably lucky. (Fictional examples.)”

Rank	Project	Credits
1	FAR AI	1768
2	AI Safety Events	1457
3	Centre For Enabling EA Learning & Research	842
4	AI Safety Support	695
5	Center for the Study of Existential Risk	607
6	Rational Animations	601
7	Campaign for AI Safety	579
8	AI X-risk Research Podcast	566
9	Simon Institute for Longterm Governance	490
10	Pour Demain	481
11	Alignment Jam	476
12	The Inside View	466
13	EffiSciences	415
14	Center for Reducing Suffering	397
15	Modeling Cooperation	233
16	QACI	158
17	AI Objectives Institute	151
18	Virtual AI Safety Unconference	142
19	Alignment Plans	131
20	AI Safety Ideas	105
21	Global Catastrophic Risk Institute	96

The Top AI Safety Bets for 2023: GiveWiki’s Latest Recommendations

The Top AI Safety Bets for 2023: GiveWiki’s Latest Recommendations

Top Projects

Ties for Tier 1

Ties for Tier 2

Ties for Tier 3

Other projects with > 200 support

Evaluation Process

Preliminaries

Lessons Learned

Wiki Credits Ranking

Qualitative Results

AI Safety Events

The Inside View

AI X-risk Research Podcast

AI Safety Ideas

Orthogonal

Center for Reducing Suffering

FAR AI

Centre For Enabling EA Learning & Research (EA Hotel)

AI Safety Support

Epilogue