Academic AI Safety/Alignment Reading List

Zak_H

Effective Altruism Forum
EA Forum

GIVING SEASON 2024

Hide table of contents

[ Question ]

Academic AI Safety/Alignment Reading List

by Zak_H

Nov 21 20231 min read1 answer 0

6

AI safetyResearch

Frontpage

Academic AI Safety/Alignment Reading List

Answers

aogara

Hey, I've found this list really helpful, and the course that comes wi...

No comments

Hi,

Is anyone aware of a reading list of mostly peer-reviewed journal articles and pre-prints for AI safety/alignment? I would like to start reading and citing more papers from this literature in my own papers.

Thanks in advance for any help :)

Zak

6 Reactions

New Answer

New Comment

1 Answers sorted by
Top

aogara

Nov 21, 2023

Hey, I've found this list really helpful, and the course that comes with it is great too. I'd suggest watching the course lecture video for a particular topic, then reading a few of the papers. Adversarial robustness and Trojans are the ones I found most interesting. https://course.mlsafety.org/readings/

[ Question ]

Academic AI Safety/Alignment Reading List

6

6

Reactions

1 Answers sorted by Top

Nov 21, 2023

1 Answers sorted by
Top