Quotes about the long reflection

MichaelA🔸

Quotes about the long reflection

MichaelA🔸

16 min readMar 5, 2020

Comments 14

Sorted by

New & upvoted

turchin

Critics: "‘Long Reflection’ Is Crazy Bad Idea" https://www.overcomingbias.com/2021/10/long-reflection-is-crazy-bad-idea.html

MichaelA🔸

Collection of sources that are highly relevant to the idea of the Long Reflection

The Precipice - Toby Ord, 2020 (particularly chapter 7 and some of its endnotes)
Toby Ord on the 80,000 Hours Podcast - 2020
Will MacAskill on the 80,000 Hours Podcast - 2018
Will MacAskill on the AI Alignment Podcast - 2018 (see also Rohin Shah's summary and commentary)
Cause prioritization for downside-focused value systems - Lukas Gloor, 2018 (I think)
AI Alignment Podcast: An Overview of Technical AI Alignment in 2018 and 2019 with Buck Shlegeris and Rohin Shah - FLI, 2020
Research agenda - Global Priorities Institute, 2019
Toby Ord's interview with the LA Review of Books - 2020 (mostly repeats things from The Precipice)
Crucial questions for longtermists - Michael Aird (me), work-in-progress (this contains a few questions related to the Long Reflection, and links to a doc with some more relevant sources)
This comment exchange between Lukas Gloor and Michael Aird (me) - 2020
This post I'm commenting on (though it largely just quotes some of the above sources)
I'm also working on some other relevant posts, which I could share drafts of on request

(The differences between this comment and the post are that I'll keep this comment up to date, it will just list sources without including quotes, and it won't include some of the less relevant sources because there's now more work on the Long Reflection than there was when I made this post.)

RandomEA

In the new 80,000 Hours interview of Toby Ord, Arden Koehler asks:

Arden Koehler: So I’m curious about this second stage: the long reflection. It felt, in the book, like this was basically sitting around and doing moral philosophy. Maybe lots of science and other things and calmly figuring out, how can we most flourish in the future? I’m wondering whether it’s more likely to just look like politics? So you might think if we come to have this big general conversation about how the world should be, our most big general public conversation right now is a political conversation that has a lot of problems. People become very tribal and it’s just not an ideal discourse, let’s say. How likely is it do you think that the long reflection will end up looking more like that? And is that okay? What do you think about that?

Ord then gives a lengthy answer, with the following portion the most directly responsive:

Toby Ord: . . . I think that the political discourse these days is very poor and definitely doesn’t live up to the kinds of standards that I loftily suggest it would need to live up to, trying to actually track the truth and to reach a consensus that stands the test of time that’s not just a political battle between people based on the current levels of power today, at the point where they’ll stop fighting, but rather the kind of thing that you expect people in a thousand years to agree with. I think there’s a very high standard and I think that we’d have [to] try very hard to have a good public conversation about it.

MichaelA🔸

Initial response: Ooh, there's a new 80k episode?! And it's with Toby Ord?! [visibly excited, rushes to phone]

Secondary response: Thanks for sharing that! Sounds like, as hoped, his book will provide and prompt a more detailed discussion of this idea than there's been so far. I look forward to gobbling that up.

Ben_West🔸

Thanks for collecting these!

RandomEA

The GPI Agenda mentions "Greg Lewis, The not-so-Long Reflection?, 2018" though as of six months ago that piece was in draft form and not publicly available.

mornemorkel443

“Life can only be understood backwards; but it must be lived forwards.” ― Søren Kierkegaard

Source: https://frasimondo.com/frasi-bellissime/

Sanjay

I'm slightly confused about the long reflection.

I understand it involves "maybe <...> 10 billion people, debating and working on these issues for 10,000 years". And *only after that* can people consider actions which may have a long term impact on humanity.

How do we ensure that

(a) everyone gets involved with working on these issues? (presumably some people are just not interested in thinking about this? Getting people to work on things they're unsuited for seems unhelpful and unpleasant)

(b) Actions that could have a long term impact on humanity could be taken unilaterally. How could people be stopped from doing that?

I think a totalitarian worldwide government could achieve this, but I assume that's not what is intended

MichaelA🔸

On (b): The first thing to note is that the Long Reflection doesn't require stopping any actions "that could have a long term impact", and certainly not stopping people considering such actions. (I assume by "consider" you meant "consider doing it this year", or something like that?)

It requires stopping people taking actions that we're not yet confident won't turn out to have been major, irreversible mistakes. So people could still do things we're already very confident are good, or things that are relatively minor.

Some good stuff from The Precipice on this, mainly from footnotes:

The ultimate aim of the Long Reflection would be to achieve a final answer to the question of which is the best kind of future for humanity. [...]

We would not need to fully complete this process before moving forward. What is essential is to be sufficiently confident in the broad shape of what we are aiming at before taking each bold and potentially irreversible action - each action that could plausibly lock in substantial aspects of our future trajectory.

Also:

We might adopt the guiding principle of minimising lock-in. Or to avoid the double negative, of preserving our options.

[Endnote:] Note that even on this view options can be instrumentally bad if they would close off many other options. So there would be instrumental value to closing off such options (for example, the option of deliberately causing our own extinction). One might thus conclude that the only thing we should lock in is the minimisation of lock-in.

This is an elegant and reasonable principle, but could probably be improved upon by simply delaying our ability to choose such options, or making them require a large supermajority (techniques that are often used when setting up binding multiparty agreements such as constitutions and contracts). That way we help avoid going extinct by accident (a clear failing of wisdom in any society), while still allowing for the unlikely possibility that we later come to realise our extinction would be for the best.

Also:

There may yet be ethical questions about our longterm future which demand even more urgency than existential security, so that they can’t be left until later. These would be important to find and should be explored concurrently with achieving existential security.

Somewhat less relevant:

Protecting our potential (and thus existential security more generally) involves locking in a commitment to avoid existential catastrophe. Seen in this light, there is an interesting tension with the idea of minimising lock-in (here [link]). What is happening is that we can best minimise overall lock-in (coming from existential risks) by locking in a small amount of other constraints.

But we should still be extremely careful locking anything in, as we might risk cutting off what would have turned out to be the best option. One option would be to not strictly lock in our commitment to avoid existential risk (e.g. by keeping total risk to a strict budget across all future centuries), but instead to make a slightly softer commitment that is merely very difficult to overturn. Constitutions are a good example, typically allowing for changes at later dates, but setting a very high bar to achieving this.

With this in mind, we can tweak your question to "Some actions that could turn out to be major, irreversible mistakes from a the perspective of the long-term future could be taken unilaterally. How could people be stopped from doing that during the Long Reflection?"

This ends up being roughly equivalent to the question "How could we get existential risk per year low enough that we can be confident of maintaining our potential for the entire duration of the Long Reflection (without having to take actions like locking in our best guess to avoid being preempted by something worse)?"

I don't think anyone has a detailed answer to that. But one sort-of promising thing is that we may have to end up with some decent ideas of answers to that in order to just avoid existential catastrophe in the first place. I.e., conditional on humanity getting to a Long Reflection process, my credence that humanity has good answers to those sorts of problems is higher than my current credence on that matter.

(This is also something I plan to discuss a bit more in those upcoming(ish) drafts.)

MichaelA🔸

I think being left slightly confused about the long reflection after reading these quotes is quite understandable. These quotes don't add up to a sufficiently detailed treatment of the topic.

Luckily, since I posted this, Toby Ord gave a somewhat more detailed treatment in Chapter 7 of The Precipice, as well as in his 80k interview. These sources provide Ord's brief thoughts on roughly the questions you raise. Though I still think more work needs to be done here, including on matters related to your question (b). I've got some drafts coming up which will discuss similar matters, and hopefully MacAskill's book on longtermism will go into more detail on the topic as a whole.

On (a): I don't think everyone should be working on these questions, nor does Ord. I'd guess MacAskill doesn't, though I'm not sure. He might mean something like "the 10 billion people interested and suited to this work, out of the 20+ billion people alive per generation at that point", or "this is one of the major tasks being undertaken by humanity, with 10 billion people per generation thus contributing at least indirectly, e.g. by keeping the economy moving".

I also suspect we should, or at least will, spend under 10,000 years on this (even if we get our act together regarding existential risks).

Ord writes in The Precipice:

It is unclear [exactly how] long such a period of reflection would need to be. My guess is that it would be worth spending centuries (or more) before embarking on major irreversible changes to our future - committing ourselves to one vision or another. This may sound like a long time from our perspective, but life and progress in most areas would not be put on hold. Something like the Renaissance may be a useful example to bear in mind, with intellectual projects spanning several centuries and many fields of endeavour. If one is thinking about extremely longterm projects, such as whether and how we should settle other galaxies (which would take millions of years to reach), then I think we could stand to spend even longer making sure we are reaching the right decision.

Donald Hobson

but just thought that slavery was a pre-condition for some people having good things in life. Therefore, it was justified on those grounds.

Rot13

Gung vf pyrneyl n centzngvp qrpvfvba onfrq ba gur fbpvrgl ur jnf va. Svefgyl, gur fynirel nf cenpgvfrq va napvrag Terrpr jnf bsgra zhpu yrff pehry guna pbybavny fynirel. Tvira gung nyybjvat gur fynir gb znxr gurve bja jnl va gur jbeyq, rneavat zbarl ubjrire gurl fnj svg, naq gura chggvat n cevpr ba gur fynirf serrqbz jnf pbzzba cenpgvpr, gung znxrf fbzr cenpgvprf gung jrer pnyyrq fynirel bs gur gvzr ybbx abg gung qvssrerag sebz qrog.

Frpbaqyl, ur whfgvslf vg ol bgure crbcyr univat avpr guvatf, vs fubja gur cbjre bs zbqrea cebqhpgvba yvarf sbe znxvat avpr guvatf jvgubhg fynirel, ur jbhyq unir cebonoyl nterrq gung gung jnf n orggre fbyhgvba. Rira zber fb vs fubja fbzr NV anabgrpu gung pbhyq zntvp hc nal avpr guvat.

Guveqyl, zbfg fpvragvfgf hagvy gur ynfg srj uhaqerq lrnef jrer snveyl ryvgr fbpvnyyl. Zrzoref bs fbzr hccre pynff jub pbhyq nssbeq gb rkcrevzrag engure guna jbex. Tvira gur ynetr orarsvg gurl unq, guvf ybbxf yvxr n zhpu ynetre fbhepr bs hgvyvgl guna gur qverpg avprarff bs univat avpr guvatf.
V qba'g guvax gur qrpvfvba ur znqr jnf haernfbanoyr, tvira gur fbpvny pbagrkg naq vasbezngvba ninvynoyr gb uvz ng gur gvzr.

EdoArad🔸

Why Rot13? This seems like an interesting discussion to be had

Donald Hobson

The rot13 is to make it harder to search for. I think that this is a discussion that would be easy to misinterpret as saying something offensive.

MichaelA🔸

(Redundant comment)

[This comment is no longer endorsed by its author]

Comments

More from the author

157

Survey on intermediate goals in AI governance

MichaelA🔸, MaxRa·3y ago·1m read

174

List of EA funding opportunities

MichaelA🔸·4y ago·7m read

154

Don’t think, just apply! (usually)

MichaelA🔸·4y ago·9m read

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·1w ago·Curated 9h ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

153

Maybe do the thing you wish CEA would do

alejoacelas 🔸·6d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

150

The first video from Giving What We Can's new channel is out now!

JustinPortela·2d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

Recent opportunities to take action

Find funding, fast

Austin·1d ago·3m read

New round of digital minds funding opportunities at Longview

zdgroff, Longview Philanthropy·3d ago·2m read

173

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·2w ago·4m read

MichaelA🔸

Some good stuff from The Precipice on this, mainly from footnotes:

The ultimate aim of the Long Reflection would be to achieve a final answer to the question of which is the best kind of future for humanity. [...]

We would not need to fully complete this process before moving forward. What is essential is to be sufficiently confident in the broad shape of what we are aiming at before taking each bold and potentially irreversible action - each action that could plausibly lock in substantial aspects of our future trajectory.

Also:

We might adopt the guiding principle of minimising lock-in. Or to avoid the double negative, of preserving our options.

[Endnote:] Note that even on this view options can be instrumentally bad if they would close off many other options. So there would be instrumental value to closing off such options (for example, the option of deliberately causing our own extinction). One might thus conclude that the only thing we should lock in is the minimisation of lock-in.

This is an elegant and reasonable principle, but could probably be improved upon by simply delaying our ability to choose such options, or making them require a large supermajority (techniques that are often used when setting up binding multiparty agreements such as constitutions and contracts). That way we help avoid going extinct by accident (a clear failing of wisdom in any society), while still allowing for the unlikely possibility that we later come to realise our extinction would be for the best.

Also:

There may yet be ethical questions about our longterm future which demand even more urgency than existential security, so that they can’t be left until later. These would be important to find and should be explored concurrently with achieving existential security.

Somewhat less relevant:

Protecting our potential (and thus existential security more generally) involves locking in a commitment to avoid existential catastrophe. Seen in this light, there is an interesting tension with the idea of minimising lock-in (here [link]). What is happening is that we can best minimise overall lock-in (coming from existential risks) by locking in a small amount of other constraints.

But we should still be extremely careful locking anything in, as we might risk cutting off what would have turned out to be the best option. One option would be to not strictly lock in our commitment to avoid existential risk (e.g. by keeping total risk to a strict budget across all future centuries), but instead to make a slightly softer commitment that is merely very difficult to overturn. Constitutions are a good example, typically allowing for changes at later dates, but setting a very high bar to achieving this.

(This is also something I plan to discuss a bit more in those upcoming(ish) drafts.)

Quotes about the long reflection

Collection of sources that are highly relevant to the idea of the Long Reflection

Collection of sources that are highly relevant to the idea of the Long Reflection

Quotes about the long reflection

80,000 Hours interview with MacAskill

Quote from 80,000 Hours’ summary

Quotes from the interview itself

Quotes from an AI Alignment Podcast interview with MacAskill

Cause prioritization for downside-focused value systems by Lukas Gloor

Quote from the article

My commentary

Other places where the term was used in a relevant way

Some other somewhat relevant concepts