DeepMind: Model evaluation for extreme risks

Zach Stein-Perlman

DeepMind: Model evaluation for extreme risks

Zach Stein-Perlman

1 min readMay 25, 2023

Comments 3

Sorted by

New & upvoted

Broderick McDonald

Useful model evals but more focus is needed on near-term risks from malicious actors

Michael_Wiebe

Side note: what's up with "model evals"? Seems like a jargony term that excludes outsiders.

Zeusfyi

-1

This Is where I depart from most others:

1. If you cannot define intelligence generalization scientifically in a complete and measurable way then this is a complete waste of time; you cannot assess risk usefully for something you cannot measure usefully. This is science 101

Here’s our definition at Zeusfyi

We define generalization in the context of intelligence, as the ability to generate learned differentiation of subsystem components, then manipulate, and build relationships towards greater systems level understanding of the universal construct that governs the reality. This is not possible if physics weren’t universal for feedback to be derived. Zeusfyi, Inc is the only institution that has scientifically defined intelligence generalization. The purest test for generalization ability; create a construct with systemic rules that define all possible outcomes allowed; greater ability to predict more actions on first try over time; shows greater generalization; with >1 construct; ability to do same; relative to others.

Comments

More from the author

220

FLI open letter: Pause giant AI experiments

Zach Stein-Perlman·3y ago·3m read

134

Maybe Anthropic's Long-Term Benefit Trust is powerless

Zach Stein-Perlman·2y ago·3m read

128

Introducing AI Lab Watch

Zach Stein-Perlman·2y ago·2m read

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·1w ago·Curated 4d ago·22m read

111

Maybe do the thing you wish CEA would do

alejoacelas 🔸·4d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·6d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Recent opportunities to take action

New round of digital minds funding opportunities at Longview

zdgroff, Longview Philanthropy·23h ago·2m read

We've Just Launched Our Grant Readiness Course

Deena Englander·12h ago·1m read

RP is looking for project founders in neglected animal areas

Rethink Priorities·4d ago·7m read