Distillation of "How Likely is Deceptive Alignment?"

NickGabs

Distillation of "How Likely is Deceptive Alignment?"

NickGabs

11 min readDec 1, 2022

Comments 1

Sorted by

New & upvoted

DavidW

Thanks for summarizing this! I have a very different perspective on the likelihood of deceptive alignment, and I'd be interested to hear what you think of it!

Comments

More from the author

Lessons from Three Mile Island for AI Warning Shots

NickGabs·3y ago·18m read

Stress Externalities More in AI Safety Pitches

NickGabs·3y ago·3m read

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·1w ago·Curated 5d ago·22m read

116

Maybe do the thing you wish CEA would do

alejoacelas 🔸·5d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

RP is looking for project founders in neglected animal areas

Rethink Priorities·5d ago·7m read

TLDR; To help the effective animal advocacy movement cost-effectively absorb greater amounts of funding in the near future, we are seeking expressions of interest from people who could found a new organization focused on: * Highly neglected animals: insects, wild animals, shrimp, fish, etc, or * AI and animals: AI alignment and governance for animal welfare, strategic actions considering transformative AI, AI for wild animals, etc. * ...