This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
Effective Altruism Forum
EA Forum
Login
Sign up
AI Safety Newsletter
Get notified
38
AI Safety Newsletter #1 [CAIS Linkpost]
[anonymous]
·
2y
ago
0
0
56
AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media
2 authors
Oliver Z
,
Dan H
·
2y
ago
· 4m read
1
1
35
AI Safety Newsletter #3: AI policy proposals and a new challenger approaches
2 authors
Oliver Z
,
Dan H
·
2y
ago
· 5m read
1
1
35
AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 6m read
2
2
60
AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 5m read
0
0
32
AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 7m read
1
1
23
AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 8m read
0
0
16
AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 7m read
3
3
12
AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 9m read
2
2
30
AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 8m read
3
3
25
AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 10m read
0
0
26
AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 5m read
0
0
7
AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer
3 authors
Center for AI Safety
,
Dan H
,
Corin Katzke
·
2y
ago
· 7m read
0
0
15
AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 9m read
0
0
12
AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 6m read
0
0
12
AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 10m read
0
0
13
AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 5m read
0
0
15
AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws
2 authors
Center for AI Safety
,
Dan H
·
2y
ago
· 6m read
1
1
7
AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering
2 authors
Center for AI Safety
,
Dan H
·
1y
ago
· 6m read
0
0
16
AISN #24: Kissinger Urges US-China Cooperation on AI, China's New AI Law, US Export Controls, International Institutions, and Open Source AI
3 authors
Center for AI Safety
,
Dan H
,
Corin Katzke
·
1y
ago
· 7m read
1
1
21
AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks
2 authors
Center for AI Safety
,
Dan H
·
1y
ago
· 7m read
0
0
11
AISN #26: National Institutions for AI Safety, Results From the UK Summit, and New Releases From OpenAI and xAI
4 authors
Center for AI Safety
,
Corin Katzke
,
allisoncyhuang
,
Dan H
·
1y
ago
· 7m read
0
0
10
AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And Klobuchar
4 authors
Center for AI Safety
,
Dan H
,
Corin Katzke
,
allisoncyhuang
·
1y
ago
· 7m read
0
0
17
AISN #28: Center for AI Safety 2023 Year in Review
2 authors
Center for AI Safety
,
Dan H
·
1y
ago
· 6m read
1
1
5
AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety
3 authors
Center for AI Safety
,
Dan H
,
Corin Katzke
·
1y
ago
· 7m read
0
0
7
AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes
3 authors
Center for AI Safety
,
Dan H
,
Corin Katzke
·
1y
ago
· 7m read
1
1
27
AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office
2 authors
Center for AI Safety
,
Dan H
·
1y
ago
· 8m read
0
0
15
AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets
3 authors
Center for AI Safety
,
Corin Katzke
,
Dan H
·
1y
ago
· 10m read
2
2
19
AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI
4 authors
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Dan H
·
1y
ago
· 11m read
0
0
21
AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate
3 authors
Center for AI Safety
,
Corin Katzke
,
Dan H
·
11mo
ago
· 10m read
5
5
14
AISN #35: Lobbying on AI Regulation Plus, New Models from OpenAI and Google, and Legal Regimes for Training on Copyrighted Data
3 authors
Center for AI Safety
,
Corin Katzke
,
Dan H
·
10mo
ago
· 7m read
0
0
6
AISN #36: Voluntary Commitments are Insufficient Plus, a Senate AI Policy Roadmap, and Chapter 1: An Overview of Catastrophic Risks
4 authors
Center for AI Safety
,
Corin Katzke
,
Julius
,
Dan H
·
10mo
ago
· 6m read
0
0
15
AI Safety Newsletter #37: US Launches Antitrust Investigations Plus, recent criticisms of OpenAI and Anthropic, and a summary of Situational Awareness
5 authors
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Julius
,
Dan H
·
9mo
ago
· 6m read
0
0
8
AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate AI Plus, “Circuit Breakers” for AI systems, and updates on China’s AI industry
5 authors
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Julius
,
Dan H
·
9mo
ago
· 5m read
0
0
17
AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?
5 authors
Center for AI Safety
,
Corin Katzke
,
Julius
,
AlexaPanYue
,
Dan H
·
7mo
ago
· 7m read
0
0
6
AI Safety Newsletter #39: Implications of a Trump Administration for AI Policy Plus, Safety Engineering
5 authors
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Julius
,
Dan H
·
8mo
ago
· 7m read
0
0
12
AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking Models by Susceptibility to Jailbreaking, and Machine Ethics
5 authors
Center for AI Safety
,
Corin Katzke
,
Julius
,
andrewz
,
Dan H
·
7mo
ago
· 6m read
0
0
10
AI Safety Newsletter #42: Newsom Vetoes SB 1047 Plus, OpenAI’s o1, and AI Governance Summary
6 authors
Center for AI Safety
,
Corin Katzke
,
Julius
,
AlexaPanYue
,
andrewz
,
Dan H
·
6mo
ago
· 7m read
0
0
6
AI Safety Newsletter #43: White House Issues First National Security Memo on AI Plus, AI and Job Displacement, and AI Takes Over the Nobels
4 authors
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Dan H
·
5mo
ago
· 7m read
0
0
11
AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems
5 authors
Center for AI Safety
,
Corin Katzke
,
Julius
,
andrewz
,
Dan H
·
4mo
ago
· 6m read
0
0