Effective Altruism Forum
Topics
EA Forum

Hide table of contents

AI evaluations and standards

AI evaluations and standards

Further reading

Related entries

Contributors

Parent Topic: AI safety

AI evaluations and standards (or "evals") are processes that check or audit AI models. Evaluations can focus on how powerful models are (“capability evaluations”) and on whether models are exhibiting dangerous behaviors or are misaligned (“alignment evaluations” or "safety evaluations").Working on AI evaluations might involve developing standards and enforcing compliance with the standards.Evaluations can help labs determine whether it's safe to deploy new models, and can help with AI governance and regulation.

...

Posts tagged AI evaluations and standards

Relevance

328

Nobody’s on the ball on AGI alignment

· 3y ago · Curated 3y ago · 11m read

2

2

165

Retrospective and Learnings from AI in Context’s First Two Videos

· 2mo ago · 11m read

1

1

158

Announcing Apollo Research

· 3y ago

2

2

141

Road to AnimalHarmBench

Artūrs Kaņepājs

· 8mo ago · Curated 8mo ago · 8m read

1

1

134

Why I am Still Skeptical about AGI by 2030

· 10mo ago · 7m read

1

1

123

High-level hopes for AI alignment

Holden Karnofsky

· 3y ago · Curated 3y ago · 23m read

2

2

122

AI Governance Needs Technical Work

· 3y ago · 9m read

3

3

117

12 tentative ideas for US AI policy (Luke Muehlhauser)

· 3y ago · 5m read

3

3

117

What is the EU AI Act and why should you care about it?

· 4y ago · 8m read

1

1

114

Success without dignity: a nearcasting story of avoiding catastrophe by luck

Holden Karnofsky

· 3y ago

2

2

109

Supplement to "The Brussels Effect and AI: How EU AI regulation will impact the global AI market"

MarkusAnderljung

· 4y ago · 9m read

2

2

104

What AI companies can do today to help with the most important century

Holden Karnofsky

· 3y ago · 13m read

2

2

100

Anthropic is Quietly Backpedalling on its Safety Commitments

· 9mo ago · 7m read

1

1

99

Seeking (Paid) Case Studies on Standards

Holden Karnofsky

· 3y ago

2

2

90

AI Safety Seems Hard to Measure

Holden Karnofsky

· 3y ago · 17m read

4

4