Anthropic capture

Anthropic capture is a capability control method in which the AIan advanced artificial intelligence thinks it might be in a simulation and as such attempts to behave in ways that will be rewarded by its simulators.

BibliographyFurther reading

Bibliography

Bostrom, Nick (2014) Superintelligence: paths, dangers, strategies, Oxford: Oxford University Press, pp. 134–135.

Anthropic capture is a capability control method in which the AI thinks it might be in a simulation and as such attempts to behave in ways that will be rewarded by its simulators.

Anthropic capture is a in which the AI thinks it might be in a simulation and as such attempts to behave in ways that will be rewarded by its simulators.

Created by Pablo at 3y