You are viewing revision 1.3.0, last edited by Pablo

Capability control methods are methods to prevent undesirable outcomes from artificial intelligence by restricting what the AI can do. These methods encompass AI boxing, incentive methods (including anthropic capture), stunting, and tripwires (Bostrom 2014: 129-138). Capability control methods may be contrasted with motivation selection methods, which attempt instead to influence what the AI wants to do.


Bostrom, Nick (2014) Superintelligence: Paths, Dangers, Strategies, Oxford: Oxford University Press.

Posts tagged Capability control method
Most Relevant