PhilSci Archive

The Impossibility of AI Containment: Logical, Mathematical, and Computational Limits to Control

Haider, Sawsan (2024) The Impossibility of AI Containment: Logical, Mathematical, and Computational Limits to Control. [Preprint]

[img] Text
SHaider_AIContainment.pdf

Download (193kB)

Abstract

This paper explores the artificial intelligence (AI) containment problem, specifically addressing the challenge of creating effective safeguards for artificial general intelligence (AGI) and superintelligence. I argue that complete control—defined as full predictability of AI actions and total adherence to safety requirements—is unattainable. The paper reviews five key constraints: incompleteness, indeterminacy, unverifiability, incomputability, and incorrigibility. These limitations are grounded in logical, philosophical, mathematical, and computational theories, such as Gödel’s incompleteness theorem and the halting problem, which collectively prove the impossibility of AI containment. I argue that instead of pursuing complete AI containment, resources should be allocated to risk management strategies that acknowledge AI’s unpredictability and prioritize adaptive oversight mechanisms.


Export/Citation: EndNote | BibTeX | Dublin Core | ASCII/Text Citation (Chicago) | HTML Citation | OpenURL
Social Networking:
Share |

Item Type: Preprint
Creators:
CreatorsEmailORCID
Haider, Sawsan19seh10@queensu.ca
Subjects: Specific Sciences > Artificial Intelligence
Depositing User: Ms Sawsan Haider
Date Deposited: 16 Nov 2024 13:47
Last Modified: 16 Nov 2024 13:47
Item ID: 24223
Subjects: Specific Sciences > Artificial Intelligence
Date: 15 August 2024
URI: https://philsci-archive.pitt.edu/id/eprint/24223

Monthly Views for the past 3 years

Monthly Downloads for the past 3 years

Plum Analytics

Actions (login required)

View Item View Item