Reviews of "Is power-seeking AI an existential risk?"

Joe_Carlsmith

(Edited 10/14/2022 to include Lifland review. Edited 7/10/23 to include Levinstein review. Edited 10/18/2023 to include superforecaster reviews.)

Open Philanthropy solicited reviews of my draft report “Is power-seeking AI an existential risk?” (Edit: arXiv version here) from various sources. Where the reviewers allowed us to make their comments public in this format, links to these comments are below, along with some responses from me in blue.

Leopold Aschenbrenner
Ben Garfinkel
Daniel Kokotajlo
Ben Levinstein
Eli Lifland
Neel Nanda
Nate Soares
Christian Tarsney
David Thorstad
David Wallace
Anonymous 1 (software engineer at AI research team)
Anonymous 2 (academic computer scientist)

The table below (spreadsheet link here) summarizes each reviewer’s probabilities and key objections.

Screenshot summary of linked spreadsheet

An academic economist focused on AI also provided a review, but they declined to make it public in this format.

Added 10/18/23: With funding from Open Philanthropy, Good Judgment also solicited reviews and forecasts from 21 superforecasters regarding the report -- see here for a summary of the results. These superforecasters completed a survey very similar to the one completed by the other reviewers, except with an additional question (see footnote) about the "multiple stage fallacy."^[1] Their aggregated medians were:

Good Judgment has also prepared more detailed summaries of superforecaster comments and forecasts here (re: my report) and here (re: the other timelines and X-risk questions). See here for some brief reflections on these results, and here for a public spreadsheet with the individual superforecasters numbers and reviews (also screenshot-ed below).

Ben_West🔸Dec 26 202113

This is really cool. The format makes it easy to see different conclusions and the cruxes people are relying on to reach those different conclusions. Thanks to you and the reviewers for doing this work, and spending the time to present it in such an easily understandable way!

Jonas VApr 11 20227

Very cool!

It would be convenient to have the specific questions that people give probabilities for (e.g. I think "timelines" refers to the year 2070?)

ChanaMessingerOct 17 20222

Seconding this

Effective Altruism Forum
EA Forum

Reviews of "Is power-seeking AI an existential risk?"

71

71

Reactions