How Could AI Governance Go Wrong?

HaydnBelfield

Comments 7

Sorted by

New & upvoted

Thanks for sharing your talk.

I'm at the UK's Competition and Markets Authority. Very happy to talk to anyone about the intersection of competition policy and AI.

[anonymous]

Good post! I'm curious if you have any thoughts on the potential conflicts or contradictions between the "AI ethics" community, which focuses on narrow AI and harms from current AI systems (members of this community include Gebru and Whittaker) and the AI governance community that has sprung out of the AI safety/alignment community (e.g GovAI)? In my view, these two groups are quite opposed in priorities and ways of thinking about AI (take a look at Timnit Gebru's twitter feed for a very stark example) and trying to put them under one banner doesn't really make sense. This contradiction seems to encourage some strange tactics (such as AI governance people proposing different regulations of narrow AI purely to slow down timelines rather than for any of the usual reasons given by the AI ethics community) which could lead to a significant backlash.

HaydnBelfield

Hi, yes good question, and one that has been much discussed - here's three papers on the topic. I'm personally of the view that there shouldn't really be much conflict/contradictions - we're all pushing for the safe, beneficial and responsible development and deployment of AI, and there's lots of common ground.

Bridging near- and long-term concerns about AI

Bridging the Gap: the case for an Incompletely Theorized Agreement on AI policy

Reconciliation between Factions Focused on Near-Term and Long-Term Artificial Intelligence

tamgent

Agreed. One book that made it really clear for me was The Alignment Problem by Brian Christian. I think that book does a really good job of showing how it's all part of the same overarching problem area.

Peter Slattery 🔸

Thanks for taking the time to share this, Hayden. It was very useful.

To what extent do behavioural science and systems thinking/change matter for AI governance?

To give you my view: I think that nearly all outcomes that EA cares about are mediated by individual and group behaviours and decisions: Who thinks what and does what (e.g., WRT. careers, donations, and advocacy) etc. All of this occurs in a broader context of social norms and laws etc.

Based on all this, I think that it is important to understand what people think and do, why they think and do what they do, and how to change that. Also, to understand how various contextual factors such as social norms and laws affect what people think and do and can be changed.

I notice related work on areas such as climate change, and I project that similar will be needed in AI governance. However, I don't know the extent to which people working on AI governance share that view or what work, if anything, that has been done. I'd be interested to hear any thoughts that you have time to share.

Also, I'd really appreciate if you can suggest any good literature or people to engage with.

tamgent

I'm not Hayden but I think behavioural science is useful area for thinking about AI governance, in particular about the design of human-computer interfaces. One example with current widely deployed AI systems is recommender engines (this is not a HCI eg). I'm trying to understand the tendencies of recommenders towards biases like concentration, or contamination problems, and how they impact user behaviour and choice. Additionally, how what they optimise for does/does not capture their values, whether that's because of a misalignment of values between the user and the company or because it's just really hard to learn human preferences because they're complex. In doing this, it's really tricky to actually distinguish in the wild between the choice architecture (behavioural parts) vs the algorithm when it comes to attributing to users' actions.

HaydnBelfield

Hi both,

Yes behavioural science isn't a topic I'm super familiar with, but it seems very important!

I think most of the focus so far has been on shifting norms/behaviour at top AI labs, for example nudging Publication and Release Norms for Responsible AI.

Recommender systems are a great example of a broader concern. Another is lethal autonomous weapons, where a big focus is "meaningful human control". Automation bias is an issue even up to the nuclear level - the concern is that people will more blindly trust ML systems, and won't disbelieve them as people did in several Cold War close calls (eg Petrov not believing his computer warning of an attack). See Autonomy and machine learning at the interface of nuclear weapons, computers and people.

Jess Whittlestone's PhD was in Behavioural Science, now she's Head of AI Policy at the Centre for Long-Term Resilience.

Comments

	Accident	Misuse	Structure
Near term	Concrete Problems	Malicious Use of AI	Flash Crash -> Flash War
Long term	Human Compatible	Superintelligence	‘Thinking About Risks From AI’

	Corporate	State
Racing	Race to the bottom, conflict
Dominance	Illegitimate, unsafe, misuse

	Corporate	State
Racing	Collaboration & cooperation	Arms control
Dominance	Antitrust & regulation	International constraints

How Could AI Governance Go Wrong?

Talk Transcript

My Background & CSER

Why care about existential risks?

What's AI governance?

How could AI governance go wrong? General argument: this could be a big deal

How could AI governance go wrong? Specific arguments: accident, misuse and structure

Racing vs dominance

What is to be done?