You are viewing revision 1.5.0, last edited by Pablo
Anthropic capture is a capability control method in which an advanced artificial intelligence thinks it might be in a simulation and as such attempts to behave in ways that will be rewarded by its simulators.