Impactful Forecasting Prize Results and Reflections

elifland; Misha_Yagudin

Comments 10

Sorted by

New & upvoted

This was a cool contest, thanks for running it! In my view there's a lot of value in doing this. Doing a deep dive into polygenic selection for IQ was something I had wanted to do for quite a while and your contest motivated me to finally sit down and actually do it and to write it up in a way that would be potentially useful to others.

I think your initial criteria of how much a writeup changed your minds may have played a role in fewer than expected entries as well. Your forecasts on the set of questions seemed very reasonable and my own forecasts were pretty similar on the ones I had forecasted, so I didn't feel that I had much to contribute in terms of the contest criteria for most of them.

Hopefully that's helpful feedback to you or anyone else looking to run a contest like this in the future!

Misha_Yagudin

Yes, thank you; that makes sense and is very helpful!

elifland

Thanks for sharing Ryan, and that makes sense in terms of another unintended consequence of our judging criteria; good to know for future contests.

Ryan Beck

No problem!

Also if you're interested in elaborating about why my scenarios were unintuitive I'd appreciate the feedback, but if not no worries!

elifland

At first I thought the scenarios were separate so they would be combined with an OR to get an overall probability, which then made me confused when you looked at only scenario 1 for determining your probability for technological feasibility.

I was also confused about why you assigned 30% to polygenic scores reaching 80% predictive power in Scenario 2 while assigning 80% to reaching saturation at 40% predictive power in the Scenario 1, because when I read 80% to reach saturation at 40% predictive power I read this as "capping out at around 40%" which would only leave a maximum of 20% for scenarios with much greater than 40%?

Finally, I was a little confused about where the likelihood of iterated embryo selection fit into your scenarios; this seems highly relevant/important and is maybe implicitly accounted for in e.g. "Must be able to generate 100 embryos to select from"? But could be good to make more explicit.

Ryan Beck

There are good points and helpful, thanks! I agree I wasn't clear about viewing the scenarios exclusively in the initial comment, I think I made that a little clearer in the follow up.

when I read 80% to reach saturation at 40% predictive power I read this as "capping out at around 40%" which would only leave a maximum of 20% for scenarios with much greater than 40%?

Ah I think I see how that's confusing. My use of the term saturation probably confuses things too much. My understanding is saturation is the likely maximum that could be explained with current approaches, so my forecast was an 80% chance we get to the 40% "saturation" level, but I think there's a decent chance our technology/understanding advances so that more than the saturation can be explained, and I gave a 30% chance that we reach 80% predictive power.

That's a good point about iterated embryo selection, I totally neglected that. My initial thought is it would probably overlap a lot with the scenarios I used, but I should have given that more thought and discussed it in my comment.

FJehn

Oh wow, did not really enter to win anything. I just participated because I thought the idea is really cool and it gave me a good opportunity diving into a variety of topics. A pleasant surprise :)

I am a bit surprised by how few people participated. If I remember correctly, 4/13 submissions were by me. I talked about this prize with several people and all seemed eager to participate, but apparently they didn't. So, I am not sure if the lack of forecasters is due to too little promotion (though more would probably helped as well). Seems like there is a large gap between "I like the idea of this" and really sit down and participating.

However, I hope you do something like this again, as it helped me a lot to have a selection of meaningful questions to do forecasts on. Just opening Metaculus can be a bit overwhelming.

If this happens again, I'll try to hold the people that like the idea accountable, so that they do a forecast and not only think about it.

qassiov

Just saw this, thank you! I'm honoured to be listed alongside these insightful forecasters — and thank you for running it. I found this competition very fun and motivational, and think this kind of thing works well for fixing some incentive problems.

I can't say why more people didn't submit, but I can say what did help prompt me to submit two rationales:

Being aware of it: I checked the EA Forum when your announcement was near the top, which was fortunate because I often miss valuable posts here when I'm busy.
- More publicity would help, but I also think that having a way for people to 'opt in' to the competition when they see it would be good. I set my own reminders to do it, but I imagine others intended to participate but ended up forgetting.
The sense that it required a relatively small time-investment in proportion to the likelihood of winning a prize
- I think it's good knowing your rationales don't need to be perfect/comprehensive/outstanding in order to receive a prize
- Allowing/expecting people to submit multiple entries helps for this, as it meant I didn't need to worry too much about choosing the ideal question; being allowed just one entry can be off-putting I think
- Similarly, offering $4,000 across a maximum of 15 prizes stopped me from being ambitious and perfectionistic — the timeframe given was also good for that. However, it's quite odd to be motivated by small prizes, and I expect people may have been put off by the small-ish reward

Finally, I expect another reason why the timing may have led to fewer entries was because Future Fund applications were due 10 days later (that seems like a long time, but I know that its approaching deadline was on my mind when I submitted my rationales).

jwithing

I didn’t realize this was going on—I gotta check these forums more often! I love the transparency provided in your reflections here.

Anthony Repetto

There are a few blind-spots with this competition, and I am sitting where they intersect:

The Ethical Demand to Stop Bad Predictions - If I have a prediction that "this plane will crash", then it is wrong of me to make money off of that catastrophe, by allowing it to happen. I would be an accomplice. So, if I predict a negative outcome, I will attempt to prevent that outcome. Necessarily, my success would make my prediction false, yet it would be false for the wrong reason.
Who Creates and Chooses the Topics - Metacalculus is generating the topics, and then your team is curating them. This leaves-out everyone who has a unique prediction. [For example: "Brick-laying robots will cut house construction costs, which will ripple-out quickly due to 'comps', the way real estate is valued. That's when real estate investors stop buying; it's a falling knife. As a result, most Americans will be underwater on their mortgage in a decade." Where do I tell people that prediction? How does it pass your curation?]

It's a troublesome sign when the boss of the company says "Here is THE problem - how do you solve it?" Better when the boss asks "What's the problem?" I don't see forecasting succeeding at its aim, if it fails to be receptive to unique outsider predictions, or those are lost in a pile due to curation-by-tiny-groups. And, waiting for bad things to happen just to make a few bucks sounds like becoming a henchmen to me. Who wants to avert calamity, instead?

Comments

More from the author

357

My take on What We Owe the Future

elifland·3y ago·Curated 3y ago·31m read

216

Reasons I’ve been hesitant about high levels of near-ish AI risk

elifland·3y ago·8m read

183

Prioritizing x-risks may require caring about future people

elifland·3y ago·8m read

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·1w ago·Curated 1d ago·22m read

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·3d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Maybe do the thing you wish CEA would do

alejoacelas 🔸·1d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

Recent opportunities to take action

RP is looking for project founders in neglected animal areas

Rethink Priorities·1d ago·7m read