Prediction Markets in The Corporate Setting

NunoSempere; elifland; Misha_Yagudin

Prediction Markets in The Corporate Setting

Comments 15

Sorted by

New & upvoted

[anonymous]

Strong upvote.

FWIW, I'm pretty sure the biggest long-term obstacle is the illegality. Considering the immaturity of the technology, user-interface, question-writing process, methods to integrate these forecasts into actual decision-making for big firms, and so on. These would probably all improve in a world with vibrant, legal, let-it-rip prediction markets.

Take the early Covid forecasts as an example. It's easy for the authorities to ignore unknown nerds on Metaculus, who turned out more accurate. And we saw PolyMarket hosting questions about Covid too, but they got fined a million dollars. They still run, but their potential will be hamstrung to a mere shadow, I'm sure kept on a very tight leash.

It involves letting that stuff continue to develop, to establish and expand. Thousands of instruments trading, on all sorts of pathogen-related questions.

At some point, it would be untenable for epidemiology authorities to ignore it. It would be a bit like central bankers ignoring the stock market, or farmers ignoring the commodity futures market.

The illegality may not directly stop innovation, and doesn't stop companies from internally dabbling in it. But it has an overwhelming chilling effect on the endeavor. Imagine a startup founder trying to bake in a "Fire the CEO" market into the company's structure. They would of course be fined by the CFTC, once they're big enough to be noticeable.

Thus, prediction markets are relegated to illegitimate crypto projects, which no big firm will want to touch. Or internal experiments that die as soon as the enthusiast takes a position somewhere else, instead of being the invasively-growing market that it probably would be. Or prestige-points like Metaculus, good for hobbyists, bragging rights, and maybe a resume for an unusual think tank position. Or developing countries with a tiny fraction the potential value and talent. Or possibly good for a tax writeoff, assuming Manifold pulls that off without getting shut down or fined at some point.

All the points you raised are very illuminating, and sobering for enthusiasts like myself. I'm just surprised that when people speculate about why prediction markets haven't taken off, they seem to gloss over the one crucial, long-term problem. They're illegal, you get fined or shut down.

NunoSempere

Thanks Jotto, I agree that evaluating prediction market potential with reference to what it has achieved is messy because it has been stymied by regulations. Note though that it has been less stymied in the EU.

Misha_Yagudin

I think CFTC has no authority over play-money internal prediction markets, so that undercuts illegality a bit.

I guess one might even experiment with structuring them as real money markets, e.g., by paying winnings as "bonuses."

Ozzie Gooen

On the whole I liked this a lot, and I broadly agree.

Around "academics being too optimistic": I've seen similar a few times before and am pretty tired of it at this point. I'm happy that interesting ideas are brought forward, but I think the bias is pretty harmful. In fairness though, this is really a community issue; if our community epistemics were better, than the overconfidence of academic takes wouldn't have lead to much overconfidence of community beliefs.

Some thoughts:
1. I agree that the implementation of "general purpose many-employee prediction markets/tournaments" so far has been fairly costly. In the very least, at this point, it's clearly not, "a clear big win."

2. That said, I think the above is more restrictive than what we should limit ourselves to. Note that:
A. "Structured, team prediction systems" already perform well. See regular financial projections made by accounting teams, sales figures made by sales teams, tech time delivery estimates made by technical teams. I think that these systems can very clearly be improved by "prediction tournament" methods, like using scoring rules.
B. Many companies already pay for analysts/consultants/strategists to do broad forecasts. These could also be augmented with forecasting techniques.

I think it's pretty clear that some sorts of forecasting clearly work and provide business value, and that others don't. This post didn't seem to get into these sorts of forecasting setups.

3. So, a few groups have tried "in-house general purpose many-employee prediction markets/tournaments", but they haven't taken off. I think our prior should be that they're difficult, as most "weird new trends" either take a lot of figuring-out to get right, or just die. In order to get a sense of the potential, it's much more important to focus on some successes, than on the success of the average participant. This means that at this point, some promising future research would be, "Try to identify the successes, and understand them very well." (Again, I agree that our expectations should be much lower than what some have suggested. And given that, it makes sense to realize that we're starting more from the bottom than others thought, and take the corresponding steps).

4. I didn't particularly like the section, "must not have too large negative side-effects":

First, I agree that cultural dynamics matter a lot, and agree with a few specific points.
This section seemed interesting as a set of hypotheses as to why internal prediction markets haven't taken off, but isn't very valuable as a list of reasons why we should be excited about internal prediction markets or not.
To the above point: I'm sure there are also a lot of positive side-effects, and these weren't mentioned in this piece. It seems to me like encouraging these markets would help make an organization more truth-seeking and candid, which is a cultural transformation that seems like it could be really positive. Public discussions of a company's prediction systems would act as advertising to recruit the kinds of people this would be a fit for. My guess is that it's a niche thing for now, but for the right sorts of corporate cultures (like hedge funds), it could be great.
The specific points in this section seemed particularly speculative to me, like they come from a particular worldview.
One likely bias is not that it has negative side-effects, but rather, that it's just bad for an important manager for principal-agent problems. For example, a middle manager really doesn't want others to know that a team is failing, even if the CEO would much prefer it be known. This would make these systems attractive to top management.

5. In the "conclusions" section, the authors seem to assume a binary; that the company can either "go all in on internal prediction markets" or "not do them completely". Maybe this was assumed as part of the research proposal, but I don't agree with the assumption.

Doing small-scale experiments with motivated actors is really cheap and almost always the first step when trying out any new method. I agree with the conclusions that "going all in at once" is a bad strategy, but it seems really easy to try it out on smaller scales for a while and see how it goes. This could mean:
1) Try it with a few small teams who seem like particularly good fits.
2) Make it available to the entire organization, but only have around 5-20 interesting questions per year.
3) Consider really bare-bones versions of prediction tournaments. Like, just having a whiteboard of some key questions, or simple spreadsheets, or similar.

6. Thinking about it more, it seems like internal prediction systems are probably a good fit for cultures that are candid/nerdy, and a bad fit for others (especially as it's so early).

7. There's also a pretty big positive externality of clever groups trying these sorts of methods out and writing about them. We need some organizations to do this in order to develop the methods better. I'm not sure at all if this was a key consideration for Upstart, but I think it could be for others reading this.

NunoSempere

I've seen similar a few times before and am pretty tired of it at this point

I think I'd sort of encountered the issue theoretically, and maybe some ambiguous cases, but I researched this one at some depth, and it was more shocking.

Fair point on 2. (prediction markets being too restrictive) and 3. ()

4. I think is a feature of the report being aimed at a particular company, so considerations around e.g., office politics making prediction markets fail are still important. As you kind of point out, overall this isn't really the report I would have written for EA, and I'm glad I got bought out of that.

5. I don't think this is what we meant, e.g., see:

Like Eli below, I am also in favour of starting with small interventions and titrating one's way towards more significant ones.

For internal predictions, start with interventions that take the least amount of employee time

I.e., we agree that small experiments (e.g., "Delphi-like automatic prediction markets built on top of dead-simple polls") are great. This could maybe have been expressed more clearly.

On the other hand, I didn't really have the impression that there was someone inside Upstart willing to put in the time to do the experiments if we didn't.

6. Sure. One thing we were afraid was cultures sort of having the incentive to pretend they were more candid that they really are. Social desirability bias feels strong.

7. (experimentation having positive externalities.) Yep!

Paal Fredrik Skjørten Kvarberg

On 4., I very much agree that this section could be more nuanced by mentioning some positive side-effects as well. There might be many managers who fear being undermined by their employees. And surely many employees might feel shameful if they are wrong all the time. However, I think the converse is also true. That managers are insecure, and would love for the company to take decisions on complex hard to determine issues collectively. And that employees would like an arena to express their thoughts on things (where their judgments are heard, and maybe even serves to influence company strategy). I think this is an important consideration that didn't get through very clearly. There are other plausible goods of prediction markets that aren't mentioned in the value prop, but which might be relevant to their expected value.

Jackson Wagner

This is really good, thanks! I am a fellow fan of prediction markets and have been puzzled by their slow adoption, so I really appreciate this comprehensive and honest look at some of their drawbacks.

Seems like some key interventions might be:

As always, making prediction markets more legal / less banned...
You talk about how teamwork philosophies like scrum/agile/etc, and techniques like kanban boards and issue tracking and internal company wikis, have really good polished software tools (like gitlab) but prediction markets don't. Seems like there is a good opening for a gitlab-like company to handle the regulatory/legal issues and offer a prediction market solution that could be adapted for companies' specific situations? Idk if anything is happening in that space or has been tried before.

You say, "Sales are important but forecasting sales might not be as important. In particular, forecasting sales might not meaningfully help increase them." That makes total sense. But I gotta wonder... this is just begging for the classic futarchy technique of conditional prediction markets! Just like predicting "what will USA GDP be under a republican president " / "GDP under a democrat", just predict sales conditional on marketing strategy A versus marketing strategy B! Obviously this turns up the complexity required for an endeavor that is already clunky and unfamiliar... but it could be worthwhile especially if some of the complexity could be handled by a Gitlab-style company.

NunoSempere

Hey, I appreciate this comment. I've shared this post with a few prediction markets people; we'll see if any of them want to become the Gitlab of prediction markets.

MichaelA🔸

Thanks for this post! I've only read the Executive Summary so far, but it sounds interesting, hope I get a chance to read later. I also just wanted to add a link to Issues with Futarchy (by Lizka, during her internship at Rethink Priorities) since I think it covers somewhat similar ground, I thought it was interesting, and I don't think it's cited here (based on a quick search + based on there being no "pingback" to this post from Lizka's). So it might be of interest to you three and/or to readers of this post.

(EDIT: I've now read this whole post and indeed found it interesting.)

Paal Fredrik Skjørten Kvarberg

Thank you all for posting this! I am one of the people who are confused by the puzzle you make serious inroads towards shedding light on in this post. I really appreciate that you break down explanatory factors in the way you do. To me, it seems like all four factors are important pieces of the puzzle. Here they are:

The markets must have a low enough cost to create and maintain.
The markets must provide more value to decision-makers than the cost to create them and to subsidize predictions on them.
The markets must be attractive enough to traders to elicit accurate predictions.
The markets must not have large negative side-effects, such as costs to the company's dynamics and morale.

Although you explain the idea behind each of these, I have a hard time making a mental model of their relative importance compared to each other. Do you think that such an exercise is feasible, and if so, do any of you have a conception of the relative explanatory strength of any factor when considered against the others? Also, do you think that it is likely that the true explanation has nothing to do with any of these? In that case, how likely?

elifland

I really appreciate that you break down explanatory factors in the way you do.

I'm happy that this was useful for you!

I have a hard time making a mental model of their relative importance compared to each other. Do you think that such an exercise is feasible, and if so, do any of you have a conception of the relative explanatory strength of any factor when considered against the others?

Good question. We also had some trouble with this, as it's difficult to observe the reasons many corporate prediction markets have failed to catch on. That being said, my best guess is that it varies substantially based on the corporation:

For an average company, the most important factor might be some combination of (2) and (4): many employees wouldn't be that interested in predicting and thus the cost of getting enough predictions might be high, and there is also just isn't that much appetite to change things up.
For an average EA org, the most important factors might be a combination of (1) and (2): the tech is too immature and writing + acting on good questions takes too much time such that it's hard to find the sweet spot where the benefit is worth the cost. In particular, many EA orgs are quite small so fixed costs of setting up and maintaining the market as well as writing impactful questions can be significant.

This Twitter poll by Ozzie and the discussion under it is also interesting data here; my read is that the mapping between Ozzie's options and our requirements are:

They're undervalued: None of our requirements are substantial enough issues.
They're mediocre: Some combination of our requirements (1), (2), and (3) make prediction markets not worth the cost.
Politically disruptive: Our requirement (4).
Other

(3) won the poll by quite a bit, but note it was retweeted by Hanson which could skew the voting pool (h/t Ozzie for mentioning this somewhere else) .

Also, do you think that it is likely that the true explanation has nothing to do with any of these? In that case, how likely?

The most likely possibility I can think of is the one Ozzie included in his poll: prediction markets are undervalued for a reason other than political fears, and all/most of the companies made a mistake by discontinuing them. I'd say 15% for this, given that the evidence is fairly strong but there could be correlated reasons companies are missing out on the benefits. In particular, they could be underestimating some of the positive effects Ozzie mentioned in his comment above.

As for an unlisted explanation being the main one, it feels like we covered most of the ground here and the main explanation is at least related to something we mentioned, but unknown unknowns are always a thing; I'd say 10% here .

So that gives me a quick gut estimate of 25%; would be curious to get others' takes.

Elicit Prediction (forecast.elicit.org/binary/questions/QE_p9ZG5q)

Paal Fredrik Skjørten Kvarberg

Thank you for this. This is all very helpful, and I think your explanations of giving differential weights to factors for average orgs and EA orgs seems very sensible. The 25% for unknown unknowns is probably right too. It doesn't seem unlikely to me that most folks at average orgs would fail to understand the value of prediction markets even if they turned out to be valuable (since it would require work to prove it).

It would really surprise me if the 'main reason' why there is a lack of prediction markets had nothing to do with anything mentioned in the post. I think all unknown unknowns might conjunctly explain 25% of why prediction markets aren't adopted, but the chance of any single unknown factor being the primary reason is, I think, quite slim.

Tsunayoshi

"Moreover, I observe that machine-learning or model-based or data-analysis solutions on forecasting weather, pandemics, supply chain, sales, etc. are happily adopted, and the startups that produce them reach quite high valuations. When trying to explain why prediction markets are not adopted, this makes me favor explanations based on high overhead, low performance and low applicability over Robin Hanson-style explanations based on covert and self-serving status moves."

I agree that the success of bespoke ml tools for forecasting negates some of the Hansonian explanations, but probably not most of them.

As ML tools replace human forecasts, they do not pose a threat to the credibility of executives. They do not have to provide their own forecasts that could later be falsified.
(Speculative) The forecasts produced by such tools are presumably not visible to every employee, while many previous instances of prediction markets had publicly visible aggregate predictions.
These tools forecast issues that managers are not traditionally expected to be able to forecast. Weather and pandemics are certainly not in the domain of executives, and I am unsure whether managers usually engage in supply chain and sales predictions.
These tools do not actually provide answers that could be embarrassing to executives, and for which prediction markets with aggregated human expertise could be useful. For example, machine learning cannot predict "conditional on proposal by CEO Smith, what will our sales be". A good test for this explanation could be how many companies allow feedback to strategy proposals by employees and visible to all employees.

NunoSempere

These tools forecast issues that managers are not traditionally expected to be able to forecast

The thing is, not really. Some of these ML companies offer predictions for employee retention or project timelines, which managers would in fact be expected to forecast.

mickbransfield

In this webinar from a Dan Schwarz of Metaculus recommended needing more than 10K employees at a company to have enough active traders in an internal market. (He also seemed to reference this paper when discussing corporate prediction markets).

Comments

Ozzie Gooen

4. I didn't particularly like the section, "must not have too large negative side-effects":

First, I agree that cultural dynamics matter a lot, and agree with a few specific points.
This section seemed interesting as a set of hypotheses as to why internal prediction markets haven't taken off, but isn't very valuable as a list of reasons why we should be excited about internal prediction markets or not.
To the above point: I'm sure there are also a lot of positive side-effects, and these weren't mentioned in this piece. It seems to me like encouraging these markets would help make an organization more truth-seeking and candid, which is a cultural transformation that seems like it could be really positive. Public discussions of a company's prediction systems would act as advertising to recruit the kinds of people this would be a fit for. My guess is that it's a niche thing for now, but for the right sorts of corporate cultures (like hedge funds), it could be great.
The specific points in this section seemed particularly speculative to me, like they come from a particular worldview.
One likely bias is not that it has negative side-effects, but rather, that it's just bad for an important manager for principal-agent problems. For example, a middle manager really doesn't want others to know that a team is failing, even if the CEO would much prefer it be known. This would make these systems attractive to top management.

6. Thinking about it more, it seems like internal prediction systems are probably a good fit for cultures that are candid/nerdy, and a bad fit for others (especially as it's so early).

Company	Source	Notes
Eli Lilly	The End Of Management, The Times	Large American pharmaceutical corporation
Ford & others	Cowgill and Zitzewitz, 2015
Goldman Sachs, Deutsche Bank	Wolfers and Zitzewitz, 2004	High volume markets, website only remains on the Internet Archive
Google	Cowgill, Wolfers, et al., 2009
Hewlett Packard	Chen and Plott, 2002	Markets were thinly traded, but still performed better than HP's own predictions
Koch Industries	Cowgill and Zitzewitz, 2013	Early draft of the 2015 paper mentions Koch Industries, though this was removed in the final version.
Microsoft	Prediction Markets at Microsoft	Very clear pdf; very much worth downloading and reading.
Nokia	Hankins and Lee, 2011
Siemens	Ortner, 1998	Prediction markets predicted deadlines better than other processes.
Yahoo	Bloomberg
Yandex	Interview with Yandex employee.

Prediction Markets in The Corporate Setting

Prediction Markets in The Corporate Setting

Executive Summary

Introduction

What are prediction markets

Value proposition

Track record

High-profile companies that have used prediction markets.

Academic consensus

What is left unsaid in the academic literature

Requirements and challenges for a well-functioning prediction market

Categorization scheme

The market must have a low enough cost to create and maintain

The questions provided by the market must provide more value to decision-makers than the cost to create and predict on them

The market must be attractive enough to traders to elicit accurate predictions

The market must not have too large negative side-effects, such as costs to the company's dynamics and morale

Other Information Aggregation Mechanisms

External platforms

Specialized machine learning/data-analysis systems

Internal forecasting competitions

Delphi Method

Automatic Prediction Markets, Pseudo Prediction Markets

Low-tech options: Surveys and Interviews

Conclusion

Nuño Sempere

Misha Yagudin

Eli Lifland