This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
Effective Altruism Forum
EA Forum
Login
Sign up
R
Ratnaditya
Product Manager @ Microsoft
0 karma
Joined
May 2026
Working (15+ years)
Message
Get notified
Posts
2
Sorted by New
1
Probing is not enough; a validity audit for any probe
Ratnaditya
Ratnaditya
·
6d
ago
· 11m read
0
0
1
Eval-related prompt cues predicted refusal shifts across 32k LLM rollouts
Ratnaditya
Ratnaditya
·
2mo
ago
· 1m read
0
0