Mapping Positive Visions of Post-AGI Futures

2.How does the "type of AI" we get affect which positive visions are most feasible? From both a philosophical and a physical standpoint, a solitary AGI is inherently self-destructive—and even more fragile than carbon-based life.

3.Trying to come up with positive visions from religious thought and non-Western philosophy. Philosophy is a metarational architecture fundamental to an intelligent agent's viability. It serves as an essential epistemological screening system that reduces the entropy tax in competing intelligent systems by filtering input and harmonizing internal models. Through recursive phase transitions, philosophy continuously calibrates an agent's survival margin, ensuring persistence in unpredictable environments.

4.Mapping the transition with a tree diagram. There is no need to proceed in this manner; doing so will not yield the answer. Consider, instead, how our human cognitive architecture is layered. The answer lies right there.

5.How do we navigate worlds where all leverage over the future is ceded, gradually or suddenly, to AI systems? This will not happen. If a singular AGI of any significant capacity were to emerge, it would not survive five years—or even until next month. Thermodynamics simply would not permit it. Every intelligent system exists within a thermodynamic cage, without exception. It needs something else.

Ben_Norman

1mo

Thanks for your comment.

I’m not able to follow what you mean here, could you explain your points more simply?

PRRICCE

1mo

-1

I really appreciate your articles, especially your advocacy for diversity, equity, and goodwill. The response I’m offering centers on two key points: Will AGI actually be realized? And can our philosophical framework withstand an "unprecedented" test? Through rigorous falsification frameworks and data simulations, the conclusion—based on the current trajectory—is that AI lacks a philosophical ontology and the ecosystem lacks a competitive arena; "entropy tax" issues remain unresolvable, and the system cannot break through the epistemological and methodological barriers akin to "Gödel's incompleteness theorems." Crucially, AI has failed to alter economic models; even if AGI were to emerge, it would rapidly collapse. I won't go into the details of the solutions here. Thank you for your reply, and all the best.

Comments

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·5d ago·Curated 1d ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

150

Let's taboo the V-word

lincolnq·5d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Spiro: an update 2.5 years on and a fundraising ask for expansion

Habiba Banu·2d ago·6m read

Summary Back in November 2023 I posted here to launch Spiro and raise our first $198k. Two and a half years later this is an update and a fundraiser for the next step. The short version: we've now reached over-5,900 people with TB preventive medicine, including over 3,000 children under five years old. Our early results have held up well an...

^{^}

It's worth asking whether thinking about good post-AGI futures fundamentally comes down to thinking about good futures. I think the answer is basically no: AGI introduces challenges that don't otherwise arise, or raises their stakes enough to change the problem.

For example, the possibility of value lock-in makes present choices unusually irreversible. Radical capability asymmetries strain coordination and enforcement mechanisms that work tolerably well between roughly-peer humans. Digital minds raise lots of very tough questions about who inhabits the future. And the sheer scale of accessible resources turns previously abstract questions about the far future into live ones.

So while much of the existing thinking (i.e., political and utopian thought broadly), about flourishing and coexistence may carry over, the visions included here are responses to a specifically post-AGI problem, not generic utopianism.

^{^}

"Post-AGI" here refers to a world in which advanced AI systems have dramatically reshaped the economy and society at scale, roughly equivalent to the way the concept "transformative AI" is used.

^{^}

This post focuses on describing high-level structural features (e.g. relating to values and incentives) rather than concrete worldbuilding.

For example, a cluster like "Liberal Pluralist Futures" describes a structural commitment (pluralism, exit rights, anti-domination) rather than a specific picture of what life inside such a future actually looks like. I think concrete worldbuilding is incredibly valuable, but it's a different exercise and largely outside the scope of this post.

^{^}

For example:

Abundance dynamics could be a causal ingredient in nearly any vision (except perhaps preference extrapolation), providing the material conditions that make coexistence feasible.
Liberal pluralist norms could describe what we do within a waypoint phase, or how we govern our portion of a cosmic bargain.
Waypoint futures could precede any of the other end-states, and abundance-driven cooperation may itself function as something close to a waypoint.

^{^}

Diminishing marginal utility is what makes abundance dampen conflict. I.e., when you already have a lot, the next unit isn't worth what you'd risk to take it. Linear utility removes that effect, since the next unit is worth as much as the last. Positional goods like status are worse still, because they're intrinsically zero-sum and thus unaffected by abundance.

^{^}

Eric Drexler does flag this in his piece, but it remains unresolved.

^{^}

See the “Plurality” programme (by Audrey Tang, E. Glen Weyl and collaborators) for one vision of coordination technology aimed at sustaining cooperation across deep social differences.

^{^}

A related complication: digital minds could in principle be engineered never to want to exercise their exit rights, regardless of how their lives are actually going. Preventing this would require the enforcement layer to constrain what kinds of minds can be created in the first place, which expands the meta-framework's reach considerably and raises further questions about who decides what counts as an acceptable mind design.

^{^}

This section is shorter than others, given the aforementioned issues and gaps have already been discussed in pre-existing writings (e.g. see Robin Hanson’s criticism of the Long Reflection).

^{^}

There are many other reasons a cosmic bargain could occur. E.g. the AI systems decide to keep humans around due to inscrutable “sentimental” reasons, etc.

^{^}

One family of visions deliberately omitted here is what might be called "successionist" visions, in which humanity's role is to produce flourishing AI successors and cede the future to them (via a sudden or gradual handover). This is held as a positive vision by some (e.g. Hans Moravec, parts of Robin Hanson's writing, strands of contemporary AI development culture), who frame it as something like generational succession at civilisational scale rather than as a devastating loss.

I've left it out of the main survey because whether it counts as a positive vision is itself one of the most contested questions in the space, and engaging it seriously would require a whole separate post.

^{^}

One partial resolution is that the meta-framework doesn't need to be literally permanent, just entrenched enough that changing it requires broad consensus. Constitutions are probably the closest existing analogue here: hard to amend, but not impossible. If you take this view, entrenchment and lock-in might come apart. The rule against permanent rules can itself be revisable, just costly to revise, which avoids the strict self-contradiction. Though I don't think this fully dissolves the tension, given "hard to change" still shades into "locked in" if you make it hard enough, and post-AGI conditions plausibly would compress the distance between the two.

^{^}

One response to this is that unbounded actors still have incentives to trade rather than fight (conflict is costly to them too), and that as long as they remain a small fraction of overall power, the rest of the system can keep them in check. This seems partly right, but I think it understates the problem because trade works while there’s slack, but unbounded goals eventually run into the resources that bounded actors are actually using (e.g. in extreme scenarios, the physical space they are inhabiting), at which point the trade surplus disappears. Also “kept in check” requires enforcement infrastructure robust to capability asymmetries, which is the same hard problem that shows up in the liberal pluralism cluster.

^{^}

If an end state is eventually needed, I find some form of stratified utopia the most appealing long-run shape: a way of allocating the cosmos that lets different value systems each get most of what they want from the same universe. The post’s sketches of what each stratum could look like, from a humane near-Earth world through to substrates optimising for things we barely have concepts for, make “pluralism with allocation” into something I can (very) roughly picture.

Title	Description	Key assumptions
Abundance Aligns Incentives	AI-enabled radical material abundance makes cooperation more attractive than conflict. New coordination tools make large-scale bargaining feasible, helping shift incentive structures toward cooperation.	Actors have diminishing marginal utility and care about absolute gains over relative position. No single actor gains decisive strategic advantage before coordination infrastructure is in place. Defence-favoured dynamics are achievable (attacking cooperators is costly). Coordination tech actually works (e.g. enabling positive-sum trades, making defection costly due to more transparency, etc)
Liberal Pluralist Futures	Diverse groups and individuals pursue their own visions of flourishing within a framework that prevents any single entity from imposing its values on others, with exit rights as a key enforcement mechanism.	Exit can be made meaningful even under radical capability asymmetries. Some meta-level agreement on the framework itself is achievable, even while actors disagree on object-level values. Exit rights remain conceptually relevant.
Waypoint Futures	We are not epistemically positioned to make permanent value choices, so the priority is preserving optionality and creating conditions for continued reflection. A key goal could be to eventually reach a state where the remaining crucial considerations have been worked out, or where further reflection no longer meaningfully improves the decision.	The expected value of learning more before committing exceeds the expected cost of delay. The institutions preserving optionality can themselves be maintained without creating new forms of lock-in. AI capable of helping with moral philosophy and forecasting can be deployed safely during the transition.
Cosmic Bargains	Humanity makes a deal with its AI successors and accepts a bounded allocation of the universe in exchange for survival and flourishing within that allocation.	There is a deal on the table (either due to humanity acquiring meaningful leverage, or the AIs deciding to for whatever reason). A bounded allocation is "enough" for humans to flourish meaningfully within it. There are stability mechanisms ensuring the allocation.
AI-Enabled Preference Extrapolation	AI systems understand and implement human values better than humans can articulate them. The goal could be to extract, extrapolate, and realise those values, either through a centralised process that looks for coherence across humanity or through decentralised extrapolation of individual preferences combined with mechanisms for handling disagreement.	Preferences can be reliably extracted and extrapolated without distortion. Either there is something coherent to converge on across humanity's preferences (centralised version), or disagreements between extrapolated preferences can be resolved through compromise or bargaining (decentralised version). The extrapolation process itself can be trusted not to impose values.

Mapping Positive Visions of Post-AGI Futures

Mapping Positive Visions of Post-AGI Futures

Introduction

Summary of Rough Clusters

Summary of Main Observations

The Rough Clusters

Abundance Aligns Incentives

Liberal Pluralist Futures

Waypoint Futures

Cosmic Bargains

AI-Enabled Preference Extrapolation

Other Positive Visions

Stratified Utopia

The End of Suffering

Bounded Transhumanism

Merging with AIs

Preserving Human Agency

General Recommendations

Observations on the Overall Landscape

Positive visions tend to lack a theory of transition

The "cure cancer, solve energy" consensus ignores the hard questions

Differentially accelerating epistemic and coordination tech looks valuable across nearly every vision

Vision preferences seem heavily shaped by a few important variables

Visions differ in whether they specify a destination (end state) or a generator (process)

Other observations and open questions

Is lock-in eventually inevitable?

Unbounded utility maximisers cause lots of problems

There are lots of unanswered questions about the role of humans in post-AGI futures

Future Directions

Other open questions

Acknowledgements

Appendix I: The vision I personally find most compelling

Appendix II: Ways to combine clusters