Safety isn’t safety without a social model (or: dispelling the myth of per se technical safety)

Andrew Critch

Comments 3

Sorted by

New & upvoted

Thanks for the post!

What do you think about Open Philanthropy's grants in AI Alignment? (eg. https://www.openphilanthropy.org/grants/funding-for-ai-alignment-projects-working-with-deep-learning-systems/). Do you think the EV is positive?

And what do you think about 80,000 Hours recommending people to join big AI labs?

Davidmanheim

{making humanity more safe VS shortening AGI timelines} is itself a false dichotomy or false spectrum.
Why? Because in some situations, shortening AGI timelines could make humanity more safe, such as by avoiding an overhang of over-abundant computing resources that AGI could abruptly take advantage of if it’s invented too far in the future (the “compute overhang” argument).

I think this also ignores the counterfactual world with less safety research, where the equivalent advances, which are funded because of commercial incentives, come from less generalizable safety research, and we end up with less well prosaically aligned but similarly capable systems. (And I haven't really laid out this argument before, but I think it generalizes to the counterfactual world without OpenAI or even Deepmind being inspired by AI safety concerns.)

SummaryBot

Executive summary: Technical AI safety and alignment advances are not intrinsically safe or helpful to humanity, and their impact depends on the social context in which they are applied.

Key points:

All technical AI safety and alignment advances can potentially be misused by humans to cause harm.
AI safety and alignment advances often shorten AI development timelines by boosting capabilities.
Shortening AGI timelines is not always harmful and could improve safety in some scenarios.
Modeling the social landscape and anticipating how ideas will be used is crucial for ensuring positive impact.
Powerful social forces encourage conflating important AI concepts to build alliances, requiring active effort to maintain clarity.

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.

Comments

Safety isn’t safety without a social model (or: dispelling the myth of per se technical safety)

Myths vs reality

What to make of all this

Recap of key points