Distribution Shifts and The Importance of AI Safety

Leon Lang

Distribution Shifts and The Importance of AI Safety

Leon Lang

11 min readSep 29, 2022

Comments

Sorted by

New & upvoted

No comments on this post yet.

Be the first to respond.

Comments

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·1w ago·Curated 5d ago·22m read

113

Maybe do the thing you wish CEA would do

alejoacelas 🔸·4d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

RP is looking for project founders in neglected animal areas

Rethink Priorities·4d ago·7m read

TLDR; To help the effective animal advocacy movement cost-effectively absorb greater amounts of funding in the near future, we are seeking expressions of interest from people who could found a new organization focused on: * Highly neglected animals: insects, wild animals, shrimp, fish, etc, or * AI and animals: AI alignment and governance for animal welfare, strategic actions considering transformative AI, AI for wild animals, etc. * ...

Recent opportunities to take action

New round of digital minds funding opportunities at Longview

zdgroff, Longview Philanthropy·1d ago·2m read

Strategy Roles - Fortify Health

Tony Senanayake·9h ago·1m read

RP is looking for project founders in neglected animal areas

Rethink Priorities·4d ago·7m read

^{^}

However, there have been concerns that a representation of a goal — equal or different from the specified goal — may emerge in the ML system. See the section on inner alignment in the appendix.

^{^}

See also the Chinchilla paper for an updated view on scaling laws.

^{^}

Some would narrow this down further and replace the “AI’s behavior” with what the AI is “trying” to do, see Paul Christiano’s definition of the full alignment problem encompassing both inner and outer alignment.

Distribution Shifts and The Importance of AI Safety

Distribution Shifts and The Importance of AI Safety

Preface

Introduction

The Core Argument

Machine Learning

Distribution Shifts and Alignment Failures

High-Level Machine Intelligence Changes the Picture

A Push Toward AI Autonomy

Increased Rate of Innovation

The Resulting Disempowerment of Humanity

High-Level Machine Intelligence Might Arrive Soon

Conclusion — AI Safety Should be a Key Research Priority of Our Time

Appendix: A Broader Overview of Safety Concerns

Inner Alignment

Outer Alignment

Risks from Superintelligence

AI Governance