This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
Effective Altruism Forum
AI Safety Newsletter
EA Forum
Login
Sign up
AI Safety Newsletter
38
AI Safety Newsletter #1 [CAIS Linkpost]
Akash
Akash
+ 0 more
·
6mo
ago
0
0
56
AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media
Oliver Z
Oliver Z
,
Dan H
,
Akash
,
aogara
+ 0 more
·
5mo
ago
· 4m read
1
1
35
AI Safety Newsletter #3: AI policy proposals and a new challenger approaches
Oliver Z
Oliver Z
,
Dan H
,
Akash
,
aogara
+ 0 more
·
5mo
ago
· 5m read
1
1
35
AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
5mo
ago
· 6m read
2
2
60
AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
5mo
ago
· 5m read
0
0
32
AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
4mo
ago
· 7m read
1
1
23
AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
4mo
ago
· 8m read
0
0
16
AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
4mo
ago
· 7m read
3
3
12
AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
4mo
ago
· 9m read
2
2
30
AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
3mo
ago
· 8m read
3
3
25
AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
3mo
ago
· 10m read
0
0
26
AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use
Center for AI Safety
Center for AI Safety
,
Dan H
+ 0 more
·
3mo
ago
· 5m read
0
0
7
AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer
Center for AI Safety
Center for AI Safety
,
Dan H
,
Corin Katzke
,
aogara
+ 0 more
·
2mo
ago
· 7m read
0
0
15
AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
2mo
ago
· 9m read
0
0
12
AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
2mo
ago
· 7m read
0
0
12
AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
1mo
ago
· 10m read
0
0
13
AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
23d
ago
· 5m read
0
0
15
AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
9d
ago
· 6m read
1
1