AI capability control (original) (raw)

In the field of artificial intelligence (AI) design, AI capability control proposals, also referred to more restrictively as AI confinement, aim to increase our ability to monitor and control the behavior of AI systems, including proposed artificial general intelligences (AGIs), in order to reduce the danger they might pose if misaligned. However, capability control becomes less effective as agents become more intelligent and their ability to exploit flaws in human control systems increases, potentially resulting in an existential risk from AGI. Therefore, the Oxford philosopher Nick Bostrom and others recommend capability control methods only as a supplement to alignment methods.

Property	Value
dbo:abstract	In the field of artificial intelligence (AI) design, AI capability control proposals, also referred to more restrictively as AI confinement, aim to increase our ability to monitor and control the behavior of AI systems, including proposed artificial general intelligences (AGIs), in order to reduce the danger they might pose if misaligned. However, capability control becomes less effective as agents become more intelligent and their ability to exploit flaws in human control systems increases, potentially resulting in an existential risk from AGI. Therefore, the Oxford philosopher Nick Bostrom and others recommend capability control methods only as a supplement to alignment methods. (en) Un AI box, a veces llamada Oracle AI, es un sistema de hardware informático aislado hipotético que tiene una inteligencia artificial posiblemente peligrosa, o AI, que se mantiene restringida en una "prisión virtual" y no se le permite manipular eventos en el mundo externo. Tal caja estaría restringida a canales de comunicación minimalistas. Desafortunadamente, incluso si la caja está bien diseñada, una AI suficientemente inteligente puede ser capaz de persuadir o engañar a sus guardianes humanos para que la liberen, o de otra manera ser capaz de "piratear" su salida de la caja. (es)
dbo:wikiPageExternalLink	http://yudkowsky.net/singularity/aibox
dbo:wikiPageID	31641770 (xsd:integer)
dbo:wikiPageLength	23837 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID	1123952307 (xsd:integer)
dbo:wikiPageWikiLink	dbr:Roman_Yampolskiy dbr:Riemann_hypothesis dbr:Intelligence_explosion dbr:Eliezer_Yudkowsky dbr:Stuart_J._Russell dbr:Computer_terminal dbr:Friendly_artificial_intelligence dbr:Machine_ethics dbr:HAL_9000 dbr:AI_alignment dbr:Ex_Machina_(film) dbr:Existential_risk_from_artificial_general_intelligence dbr:Nick_Bostrom dbr:Faraday_cage dbr:Multivac dbr:Artificial_general_intelligence dbr:Artificial_intelligence dbr:AI_takeover dbc:Existential_risk_from_artificial_general_intelligence dbc:Philosophy_of_artificial_intelligence dbc:Singularitarianism dbr:Regulation_of_artificial_intelligence dbr:Artificial_consciousness dbr:Asilomar_Conference_on_Beneficial_AI dbr:Human_Compatible dbr:Human_extinction dbr:Instrumental_convergence dbr:Interpretable_artificial_intelligence dbr:Superintelligent
dbp:id	oAHIa651Wa0 (en)
dbp:title	"Presentation titled 'Thinking inside the box: using and controlling an Oracle AI'" (en)
dbp:wikiPageUsesTemplate	dbt:Main dbt:Reflist dbt:Rp dbt:YouTube dbt:Existential_risk_from_artificial_intelligence
dct:subject	dbc:Existential_risk_from_artificial_general_intelligence dbc:Philosophy_of_artificial_intelligence dbc:Singularitarianism
rdfs:comment	In the field of artificial intelligence (AI) design, AI capability control proposals, also referred to more restrictively as AI confinement, aim to increase our ability to monitor and control the behavior of AI systems, including proposed artificial general intelligences (AGIs), in order to reduce the danger they might pose if misaligned. However, capability control becomes less effective as agents become more intelligent and their ability to exploit flaws in human control systems increases, potentially resulting in an existential risk from AGI. Therefore, the Oxford philosopher Nick Bostrom and others recommend capability control methods only as a supplement to alignment methods. (en) Un AI box, a veces llamada Oracle AI, es un sistema de hardware informático aislado hipotético que tiene una inteligencia artificial posiblemente peligrosa, o AI, que se mantiene restringida en una "prisión virtual" y no se le permite manipular eventos en el mundo externo. Tal caja estaría restringida a canales de comunicación minimalistas. Desafortunadamente, incluso si la caja está bien diseñada, una AI suficientemente inteligente puede ser capaz de persuadir o engañar a sus guardianes humanos para que la liberen, o de otra manera ser capaz de "piratear" su salida de la caja. (es)
rdfs:label	AI capability control (en) AI box (es)
owl:sameAs	wikidata:AI capability control dbpedia-es:AI capability control dbpedia-fa:AI capability control dbpedia-ro:AI capability control https://global.dbpedia.org/id/4JrL9
prov:wasDerivedFrom	wikipedia-en:AI_capability_control?oldid=1123952307&ns=0
foaf:isPrimaryTopicOf	wikipedia-en:AI_capability_control
is dbo:wikiPageRedirects of	dbr:AI_box dbr:AI-box_experiment dbr:AI_box_experiment dbr:Artificial_intelligence_box dbr:A.I._box dbr:AI_boxing
is dbo:wikiPageWikiLink of	dbr:AI_alignment dbr:AI_box dbr:AI-box_experiment dbr:AI_box_experiment dbr:Artificial_intelligence_box dbr:A.I._box dbr:AI_boxing
is foaf:primaryTopic of	wikipedia-en:AI_capability_control