What does Bing Chat tell us about AI risk?

Holden Karnofsky

Comments 8

Sorted by

New & upvoted

Nathan Young

Thanks for writing this.

The top image is a nice example of good science commication we could do more of.

It is:

Simple (it's a big squiggly monster with a smily face mask)
Conveys an important idea (LLMs might seem friendly, but we don't understand how they work and if you dig a bit they are utterly alien to us, providing strange and scary answers
High fidelity (this image has already been remixed 1000s of times and it still always conveys it's central idea)
Emotionally resonant ("on this is concerning, what should we do about it")

(And it was it was initially created by EA twitter's very own Tetraspace)

https://twitter.com/TetraspaceWest/status/1608966939929636864?s=20

Holden and Yudkowsky are both very good at making complicated ideas into simple sharable ones - paperclips, King Lear problem.

If you find you have a talent for explaining things (or doodling) you might do well to make memes like this. Who knows where they will end up. (I am assuming that explaining things well is net good but I guess it has very large variance)

Yonatan Cale

I agree with everything, and still want to point out that not so long later, Musk decided to try removing the "woke" part, so maybe he shared this meme for different reasons than you or me would share it

Fighting ‘Woke AI,’ Musk Recruits Team to Develop OpenAI Rival

Writer

This article is evidence that Elon Musk will focus on the "wokeness" of ChatGPT, rather than do something useful about AI alignment. But still, we should keep in mind that news are very often incomplete or simply just plain false.

Also, I can't access the article.

Related: I've recently created a prediction market about whether Elon Musk is going to do something positive for AI risk (or at least not do something counterproductive) according to Eliezer Yudkowsky's judgment: https://manifold.markets/Writer/if-elon-musk-does-something-as-a-re?r=V3JpdGVy

Yonatan Cale

+1 for creating that market! :)

Writer

Hard agree, the shoggoth memes are great.

MichaelDello

I strongly agree that current LLM's don't seem to pose a risk of a global catastrophe, but I'm worried about what might happen when LLM's are combined with things like digital virtual assistants who have outputs other than generating text. Even if it can only make bookings, send emails, etc., I feel like things could get concerning very fast.

Is there an argument for having AI fail spectacularly in a small way which raises enough global concern to slow progress/increase safety work? I'm envisioning something like a LLM virtual assistant which leads to a lot of lost productivity and some security breaches but nothing too catastrophic, which makes people take AI safety seriously, slowing progress on more advanced AI, perhaps.

A complete spitball.

titotal

Is there an argument for having AI fail spectacularly in a small way which raises enough global concern to slow progress/increase safety work?

Given that AI is being developed by companies running on a "move fast and break things" philosophy, a spectacular failure of some sort is all but guaranteed.

It'd have to bigger than mere lost productivity to slow things down though. Social media algorithms arguably already have a body count (via radicalisation), and those have not been slowed down.

MichaelDello

Very fair response, thanks!

Comments

More from the author

135

Responsible Scaling Policy v3

Holden Karnofsky·4mo ago·43m read

644

Some comments on recent FTX-related events

Holden Karnofsky·3y ago·5m read

523

EA is about maximization, and maximization is perilous

Holden Karnofsky·3y ago·8m read

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·6d ago·Curated 15h ago·22m read

Was Partisanship Good for the Environmental Movement?

Jeffrey Heninger·2y ago·Curated 6d ago·6m read

This is the third in a sequence of posts taken from my recent report: Why Did Environmentalism Become Partisan? Summary Rising partisanship did not make environmentalism more popular or politically effective. Instead, it saw flat or falling overall public opinion, fewer major legislative achievements, and fluctuating executive actions. Public Opinion...

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·2d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Recent opportunities to take action

RP is looking for project founders in neglected animal areas

Rethink Priorities·9h ago·7m read

Time Sensitive Do Gooding Opportunities

Bentham's Bulldog·10h ago·5m read

146

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·1w ago·4m read