Some research ideas in forecasting

Jaime Sevilla

Some research ideas in forecasting

Jaime Sevilla

8 min readNov 15, 2022

Comments 5

Sorted by

New & upvoted

Javier Prieto🔸

Your likelihood_pool method is returning Brier scores >1. How is that possible? Also, unless you extremize, it should yield the same aggregates (and scores) as regular geometric mean of odds, no?

Jaime Sevilla

I am so dumb I was mistakenly using odds instead of probs to compute the brier score :facepalm:

And yes, you are right, we should extremize before aggregating. Otherwise, the method is equivalent to geo mean of odds.

It's still not very good though

dschwarz

[I wrote this comment on LW, copying to this post. Shouldn't that happen automatically?]

Nice post! I'll throw another signal boost for the Metaculus hackathon that OP links, since this is the first time Metaculus is sharing their whole 1M db of individual forecasts (not just the db of questions & resolutions which is already available). You have to apply to get access though. I'll link it again even though OP already did: https://metaculus.medium.com/announcing-metaculuss-million-predictions-hackathon-91c2dfa3f39

There are nice cash prizes too.

As the OP writes, I think most the ideas here would be valid entries in the hackathon, though the emphasis is on forecast aggregation & methods for scoring individuals. I'm particularly interested in decay of predictions idea. I don't think we know how well predictions age, and what the right strategy for updating your predictions should be for long-running questions.

Jonas Moss

Thanks for writing this.

I wrote about "decay of predictions" here. I would classify the problem as hard.
Do you have a feeling for how suitable the projects are for academic projects? Such as bachelor theses or master theses, perhaps? It would be great to show a list of projects to students!

Jaime Sevilla

Thanks Jonas!

I'd forgotten about that great article! Linked.
I feel some of these would be good bachelor / MSc theses yeah!

Comments

More from the author

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·1w ago·Curated 6d ago·22m read

137

Maybe do the thing you wish CEA would do

alejoacelas 🔸·5d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

The first video from Giving What We Can's new channel is out now!

JustinPortela·23h ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

Recent opportunities to take action

New round of digital minds funding opportunities at Longview

zdgroff, Longview Philanthropy·2d ago·2m read

173

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·1w ago·4m read

Inspiring colleagues in Luxembourg on Effective Giving + identifying infrastructural gaps

Lorenzo Fong Ponce 🔸·20h ago·12m read

Method

Weighted

Brier

-log

Questions

Neyman aggregate (p=0.36)

Yes

0.106

0.340

899

Extremized mean of logodds (d=1.55)

Yes

0.111

0.350

899

Neyman aggregate (p=0.5)

Yes

0.111

0.351

899

Extremized mean of probabilities (d=1.60)

Yes

0.112

0.355

899

Metaculus prediction

Yes

0.111

0.361

774

Mean of logodds

Yes

0.116

0.370

899

Neyman aggregate (p=0.36)

0.120

0.377

899

Median

Yes

0.121

0.381

899

Extremized mean of logodds (d=1.50)

0.126

0.391

899

Mean of probabilities

Yes

0.122

0.392

899

Neyman aggregate (o=1.00)

0.126

0.393

899

Extremized mean of probabilities (d=1.60)

0.127

0.399

899

Mean of logodds

0.130

0.410

899

Median

0.134

0.418

899

Mean of probabilities

0.138

0.439

899

Baseline (p = 0.36)

N/A

0.230

0.652

899