Hi,
Is anyone aware of a reading list of mostly peer-reviewed journal articles and pre-prints for AI safety/alignment? I would like to start reading and citing more papers from this literature in my own papers.
Thanks in advance for any help :)
Zak