I've built The Guardrail, a website that aggregates and curates AI safety research, and I'd value feedback from this community on whether it's useful and how to improve it.
The site pulls new papers from arXiv daily and uses an LLM (Gemini 3 Flash) to:
I've also processed papers from NeurIPS and ICLR (2025 only for now) with the same tagging system.
There's a weekly Editor's Choice that ranks the top 10 papers by significance and novelty, available as an email digest for those who want it.
The volume of potentially safety-relevant research on arXiv is overwhelming. I wanted a way to stay current without manually scanning hundreds of abstracts. The LLM judge isn't perfect, but it catches most things and dramatically reduces the filtering burden.
The site is open source (GitHub) and funded by a BlueDot Impact rapid grant, so I'm committed to maintaining and improving it.
Happy to answer questions about how the filtering works or take feature requests.