Effective Altruism Forum
Topics
EA Forum

Hide table of contents

AI evaluations and standards

AI evaluations and standards

Further reading

Related entries

Contributors

Parent Topic: AI safety

AI evaluations and standards (or "evals") are processes that check or audit AI models. Evaluations can focus on how powerful models are (“capability evaluations”) and on whether models are exhibiting dangerous behaviors or are misaligned (“alignment evaluations” or "safety evaluations").Working on AI evaluations might involve developing standards and enforcing compliance with the standards.Evaluations can help labs determine whether it's safe to deploy new models, and can help with AI governance and regulation.

...

Posts tagged AI evaluations and standards

Relevance

49

DeepMind: Model evaluation for extreme risks

Zach Stein-Perlman

· 2y ago

9

9

128

How technical safety standards could promote TAI safety

· 3y ago · 9m read

4

4

90

AI Safety Seems Hard to Measure

Holden Karnofsky

· 2y ago · 17m read

4

4

79

Racing through a minefield: the AI deployment problem

Holden Karnofsky

· 2y ago · 16m read

4

4

73

Case studies on social-welfare-based standards in various industries

Holden Karnofsky

· 10mo ago · 1m read

4

4

39

Trendlines in AIxBio evals

· 6mo ago · 14m read

4

4

121

AI Governance Needs Technical Work

· 3y ago · 9m read

3

3

117

12 tentative ideas for US AI policy (Luke Muehlhauser)

· 2y ago · 5m read

3

3

50

AI Risk Management Framework | NIST

𝕮𝖎𝖓𝖊𝖗𝖆

· 2y ago

3

3

20

Announcing ForecastBench, a new benchmark for AI and human forecasting abilities

Forecasting Research Institute

· 7mo ago · 3m read

3

3

7

The case for more ambitious language model evals

· 1y ago · 6m read

3

3

5

[Cause Exploration Prizes] Creating a “regulatory turbocharger” for EA relevant policies

Open Philanthropy

· 3y ago · 14m read

3

3

327

Nobody’s on the ball on AGI alignment

· 2y ago · 11m read

2

2

158

Announcing Apollo Research

· 2y ago

2

2

123

High-level hopes for AI alignment

Holden Karnofsky

· 2y ago · 23m read

2

2