Comments
Neel Nanda on Mechanistic Interpretability: Progress, Limits, and Paths to Safer AI — EA Forum