Effective Altruism Forum
EA Forum

AI Safety Newsletter

AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media

Oliver Z

· 3y ago · 4m read

AI Safety Newsletter #3: AI policy proposals and a new challenger approaches

Oliver Z

· 3y ago · 5m read

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

Center for AI Safety

· 3y ago · 6m read

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Center for AI Safety

· 3y ago · 5m read

AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control

Center for AI Safety

· 3y ago · 7m read

AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI

Center for AI Safety

· 3y ago · 8m read

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Center for AI Safety

· 3y ago · 7m read

AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?

Center for AI Safety

· 3y ago · 9m read

AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence

Center for AI Safety

· 3y ago · 8m read

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave

Center for AI Safety

· 3y ago · 10m read

AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use

Center for AI Safety

· 3y ago · 5m read

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer

Center for AI Safety

· 3y ago · 7m read

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight

Center for AI Safety

· 3y ago · 9m read

AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety

Center for AI Safety

· 3y ago · 6m read

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities

Center for AI Safety

· 3y ago · 10m read

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy

Center for AI Safety

· 3y ago · 5m read

AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws

Center for AI Safety

· 3y ago · 6m read

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering

Center for AI Safety

· 3y ago · 6m read

AISN #24: Kissinger Urges US-China Cooperation on AI, China's New AI Law, US Export Controls, International Institutions, and Open Source AI

Center for AI Safety

· 3y ago · 7m read

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks

Center for AI Safety

· 3y ago · 7m read

AISN #26: National Institutions for AI Safety, Results From the UK Summit, and New Releases From OpenAI and xAI

Center for AI Safety

· 3y ago · 7m read

AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And Klobuchar

Center for AI Safety

· 2y ago · 7m read

AISN #28: Center for AI Safety 2023 Year in Review

Center for AI Safety

· 2y ago · 6m read

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety

Center for AI Safety

· 2y ago · 7m read

AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes

Center for AI Safety

· 2y ago · 7m read

AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office

Center for AI Safety

· 2y ago · 8m read

AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets

Center for AI Safety

· 2y ago · 10m read

AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI

Center for AI Safety

· 2y ago · 11m read

AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate

Center for AI Safety

· 2y ago · 10m read

AISN #35: Lobbying on AI Regulation Plus, New Models from OpenAI and Google, and Legal Regimes for Training on Copyrighted Data

Center for AI Safety

· 2y ago · 7m read

AISN #36: Voluntary Commitments are Insufficient Plus, a Senate AI Policy Roadmap, and Chapter 1: An Overview of Catastrophic Risks

Center for AI Safety

· 2y ago · 6m read

AI Safety Newsletter #37: US Launches Antitrust Investigations Plus, recent criticisms of OpenAI and Anthropic, and a summary of Situational Awareness

Center for AI Safety

· 2y ago · 6m read

AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate AI Plus, “Circuit Breakers” for AI systems, and updates on China’s AI industry

Center for AI Safety

· 2y ago · 5m read

AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?

Center for AI Safety

· 2y ago · 7m read

AI Safety Newsletter #39: Implications of a Trump Administration for AI Policy Plus, Safety Engineering

Center for AI Safety

· 2y ago · 7m read

AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking Models by Susceptibility to Jailbreaking, and Machine Ethics

Center for AI Safety

· 2y ago · 6m read

AI Safety Newsletter #42: Newsom Vetoes SB 1047 Plus, OpenAI’s o1, and AI Governance Summary

Center for AI Safety

· 2y ago · 7m read

AI Safety Newsletter #43: White House Issues First National Security Memo on AI Plus, AI and Job Displacement, and AI Takes Over the Nobels

Center for AI Safety

· 2y ago · 7m read

AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems

Center for AI Safety

· 1y ago · 6m read