Hide table of contents
You are viewing revision 1.5.0, last edited by Pablo

Anthropic capture is a capability control method in which an advanced artificial intelligence thinks it might be in a simulation and as such attempts to behave in ways that will be rewarded by its simulators.

...

(Read more)

Posts tagged Anthropic capture

Relevance