Comments
Reinforcement learning scaling might incentivise hidden reasoning architectures for AI — EA Forum