Staged release

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

190

The first video from Giving What We Can's new channel is out now!

JustinPortela·5d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

Let's taboo the V-word

lincolnq·22h ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Recent opportunities to take action

EA Organisation Updates thread: July 2026

Dane Valerie·4h ago·1m read

The EA Opportunities Board now has full-time roles

Agnes Hasselblad 🔸·6h ago·3m read

Hiring: Grants & Operations Associate - Giving What We Can

Giving What We Can🔸, Zou Xinyi 🔸·1h ago·2m read

^{^}

Toby Shevlane's dissertation. I don't recommend reading it.

^{^}

From the GPT-2 staged release OpenAI report:

In February 2019, we released the 124 million parameter GPT-2 language model. In May 2019, we released the 355 million parameter model and a dataset of outputs from all four models (124 million, 355 million, 774 million, and 1.5 billion parameters) to aid in training humans and classifiers to detect synthetic text, and assessing biases encoded in GPT-2 generated outputs. In August, we released our 774 million parameter model along with the first version of this report and additional release documentation on GitHub. We are now [in November] releasing our 1.5 billion parameter version of GPT-2 with this updated report and updated documentation.

^{^}

This post mostly-arbitrarily uses "release" and not "deploy." (I believe "deployment" includes use exclusively within the lab while "release" requires external use; in this post we're basically concerned with misuse by actors outside the lab.)

^{^}

Or rather, models that are plausibly nontrivially better for some misuse-related tasks than any other released model.