User Comment Replies

Consequentialists (in society) should self-modify to have side constraints

I'm very pleased to see this line of reasoning being promoted. Mutual transparency of agents (information permeability of boundaries) is a really important feature of & input to real-world ethics; thanks for expounding on it!

My Most Likely Reason to Die Young is AI X-Risk

Andrew Critch4y32

Thanks for calling this out, whoever you are. This is another reason why, for the last ~5 years or so, I've been pushing for the terminology "AI x-risk" and "AI x-safety" to replace "(long-term) AI risk" and "(long-term) AI safety". For audiences not familiar with the "x" or "existential" terminology, one can say "large scale risk" to point the large stakes, rather than always saying "long term".

(Also, the fact that I don't know who you are is actually fairly heartening :)

JakubK

"AGI risk" and "AGI safety" also seem good. But they don't immediately convey the high stakes.

Simplify EA Pitches to "Holy Shit, X-Risk"

Andrew Critch4y6

Neel, I agree with this sentiment, provided that it does tot lead to extremist actions to prevent x-risk (see https://www.lesswrong.com/posts/Jo89KvfAs9z7owoZp/pivotal-act-intentions-negative-consequences-and-fallacious).

Specifically, I agree that we should be explicit about existential safety — and in particular, AI existential safety — as a broadly agreeable and understandable cause area that does not depend on EA, longtermism, or other niche communities/stances. This is main reason AI Research Considerations for Human Existential Safety (ARCHES; h... (read more)

"Long-Termism" vs. "Existential Risk"

Andrew Critch4y11

Scott, thanks so much for this post. It's been years coming in my opinion. FWIW, my reason for making ARCHES (AI Research Considerations for Human Existential Safety) explicitly about existential risk, and not about "AI safety" or some other glomarization, is that I think x-risk and x-safety are not long-term/far-off concerns that can be procrastinated away.

https://forum.effectivealtruism.org/posts/aYg2ceChLMRbwqkyQ/ai-research-considerations-for-human-existential-safety (with David Krueger)

Ideally, we need to engage as many resear... (read more)

"Tech company singularities", and steering them to reduce x-risk

Andrew Critch4y1

Yes, thanks! Fixed.

“Pivotal Act” Intentions: Negative Consequences and Fallacious Arguments

Andrew Critch4y8

the way I now think about these scenarios is that there's a tradeoff between technical ability and political ability

I also like this, and appreciate you pointing out a tradeoff where the discouse was presenting an either-or decision. I'd actually considered a follow-up post on the pareto boundary between unilaterally maximizing (altruistic) utility and multilaterally preserving coordination boundaries and consent norms. Relating your ontology to mine, I'd say that in the AGI arena, technical ability contributes more to the former (unilaterally maximizing...) than the latter (multilaterally preserving...), and political ability contributes more to the latter than the former.

“Pivotal Act” Intentions: Negative Consequences and Fallacious Arguments

Andrew Critch4y11

saying "this test isn't legitimate" feels like a conceptual smudge that tries to blend all those claims together, as if each implied all of the others.

This is my favorite piece of feedback on this post so far, and I agree with it; thanks!

To clarify what I meant, I've changed the text to read "making evidence available to support global regulatory efforts from a broader base of consensual actors (see Part 3)."

Effective Altruism Forum
EA Forum

All of Andrew Critch's Comments + Replies