What's in a Pause?

Davidmanheim

What's in a Pause?

Comments

More from the author

$1 billion is not enough; OpenAI Foundation must start spending tens of billions each year

Davidmanheim·3mo ago·5m read

You Aren't in Charge of the Overton Window; Politics Is Not Interior Design

Davidmanheim·2mo ago·14m read

Who sets my org's agenda?

Davidmanheim·6mo ago·7m read

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·1w ago·Curated 5d ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

201

The first video from Giving What We Can's new channel is out now!

JustinPortela·6d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

115

Let's taboo the V-word

lincolnq·1d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Recent opportunities to take action

Lukas_Gloor

Moving beyond current needs, as both a way to ensure that domestic policy doesn’t get stuck dealing with immediate economic, equity, and political issues, I think we should push for an ambitious intermediate goal to promote the adoption of international standards regarding high-risk future models. To that end, I would call for every country to pass laws today that will trigger a full ban on deploying or training AI systems larger than GPT-4 which have not been reviewed by an international regulatory body with authority to reject applications, starting in 2025, pending international governance regimes with mandatory review provisions for potentially dangerous applications and models. This isn’t helpful for the most obvious immediate risks and economic impacts of AI - and for exactly that reason, it’s critical as a way to ensure the tremendous future risks aren’t ignored.

I strongly agree with that.

You don't talk much about compute caps as a lever elsewhere in the text, so I'm going to paste some passages I wrote on why I'm excited about compute-related interventions to slow down AI. (My summary on slowing AI is available on the database for AI governance researchers – if anyone is planning to work on this topic but doesn't have access to that database, feel free to email me and I can give you access to a copy.)

Compute seems particularly suited for governance measures: it’s quantifiable, can’t be used by multiple actors at once, and we can restrict access to it. None of these three factors apply to software (so it’s unfortunate that software progress plays a more significant role for AI timelines than compute increases). Monitoring compute access is currently difficult because compute is easy to transport, and we don’t know where much of it is. Still, we could help set up a database, demand reporting from sellers, and shift compute use from physical access to cloud computing or data center access (centralizing access helps with monitoring). The ideal target state for compute governance might be some kind of “moving bright line” of maximum compute allowances for training runs. (A static cap might be too difficult to enforce because compute costs to circumvent the cap will fall over time.) The regulation could be flexible so labs with a proven safety mindset can receive authorization to go beyond the cap. More ambitiously, there’s the idea of
hardware-enabled governance mechanisms (previous terminology: “on-chip measures”). These are tamper-proof mechanisms on chips (or on the larger hardware components of compute clusters) that would allow for actions like communicating information about a chip’s location or its past activity, remote shutdown, or restricting the chip’s communication with other chips (limiting the size of a training run it could be involved in). Hardware-enabled mechanisms don’t yet exist in a tamper-proof way, but NVIDIA has chips that illustrate the concept. I’m particularly excited about hardware-enabled governance mechanisms because they’re the only idea related to slowing AI progress that could (combined with an ambitious regulatory framework) address the problem as a whole, instead of just giving us a small degree of temporary slowdown. (Hardware-enabled mechanisms would also continue to be helpful after the first aligned TAI is developed – it’s not like coordination challenges will automatically go away at the point when an aligned AI is first developed.) Widespread implementation of such mechanisms is several years away even in a best-case scenario, so it seems crucial to get started.
Onni Arne and Lennart Heim have been looking into hardware-enabled governance mechanisms. (My sense from talking to them is that when it comes to monitoring and auditing of compute, they see the most promise in measures that show a chip's past activity, "proof of non-training.") Yonadav Shavit also works on compute governance and seems like a great person to talk to about this.

And here's an unfortunate caveat about how compute governance may not be sufficient to avoid an AI catastrophe:

Software progress vs. compute: I’m mostly writing my piece based on the assumption that software progress and compute growth are both important levers (with software progress being the stronger one). However, there’s a view on which algorithmic improvements are a lot jumpier than Ajeya Cotra assumes in her “2020 compute training requirements” framework. If so, and if we’re already in a compute overhang (in the sense that it’s realistic to assume that new discoveries could get us to TAI with current levels of compute), it could be tremendously important to prevent algorithmic exploration by creative ML researchers, even at lower-than-cutting-edge levels of compute. (Also, the scaling hypothesis would likely be false or at least incomplete in that particular world, and compute restrictions would matter less since building TAI would mainly require software breakthroughs.) In short, if the road to TAI is mostly through algorithmic breakthroughs, we might be in a pretty bad situation in terms of not having available promising interventions to slow down progress.

But there might still be some things to do to slow progress a little bit, such as improving information security to prevent leakage of insights from leading labs, and export controls on model weights.

^{^}

Including OpenAI’s Sutskever and Altman, Gary Marcus, Anthropic, and even LeCun, who publicly says that no-one should build dangerous AI - but excluding Andreeson, who is opposed to any restrictions, and instead suggests we fight misuse of AI with more AI.

^{^}

That said, we shouldn’t ignore the harms AI is doing now - and restricting already illegal or harmful uses is certainly more than justified, and I applaud work being done by AI ethicists and other groups in that direction. The people in the basement should be saved, which is sufficient justification for many of the proposed policies - but I think milquetoast policy papers don’t help that cause either, and these abuses need more drastic responses.

^{^}

The obligation for the Biological Weapons Convention is clear that countries are in violation if they allow bioweapons to be developed, even if the state party itself was not involved. The requirements for the chemical weapons convention are less broad, so AI used for chemical weapons development is not directly banned by the treaty - though it certainly seems worth considering how to prevent it. And as noted, it is currently technically and bureaucratically impossible to monitor or ban such uses.

^{^}

In the interim, joint-and-several liability for developers, application providers, and users for misuse, copyright violation, and illegal discrimination would be a useful initial band-aid; among other things, this provides motive for companies to help craft regulation to provide clear rules about what is needed to ensure on each party’s behalf that they will not be financially liable for a given use, or misuse.

What's in a Pause?

What's in a Pause?

What Does a Moratorium Include?

When and How Do We Stop?

What are concrete steps forward?

Monitoring AI Systems Now

Enforcing Extant Laws

Plan for Future Governance and Policy

Conclusion

Notes