The Impossibility of AI Containment: Logical, Mathematical, and Computational Limits to Control

Haider, Sawsan (2024) The Impossibility of AI Containment: Logical, Mathematical, and Computational Limits to Control. [Preprint]

Text
SHaider_AIContainment.pdf
Download (193kB)

Abstract

This paper explores the artificial intelligence (AI) containment problem, specifically addressing the challenge of creating effective safeguards for artificial general intelligence (AGI) and superintelligence. I argue that complete control—defined as full predictability of AI actions and total adherence to safety requirements—is unattainable. The paper reviews five key constraints: incompleteness, indeterminacy, unverifiability, incomputability, and incorrigibility. These limitations are grounded in logical, philosophical, mathematical, and computational theories, such as Gödel’s incompleteness theorem and the halting problem, which collectively prove the impossibility of AI containment. I argue that instead of pursuing complete AI containment, resources should be allocated to risk management strategies that acknowledge AI’s unpredictability and prioritize adaptive oversight mechanisms.

Export/Citation:

Social Networking:

Share |

Item Type:

Preprint

Creators:

Creators	Email	ORCID
Haider, Sawsan	19seh10@queensu.ca

Subjects:

Specific Sciences > Artificial Intelligence

Depositing User:

Ms Sawsan Haider

Date Deposited:

16 Nov 2024 13:47

Last Modified:

16 Nov 2024 13:47

Item ID:

24223

Subjects:

Specific Sciences > Artificial Intelligence

Date:

15 August 2024

URI:

https://philsci-archive.pitt.edu/id/eprint/24223

Monthly Views for the past 3 years

Monthly Downloads for the past 3 years

Plum Analytics

Actions (login required)

View Item

Search & Browse

Information

The Impossibility of AI Containment: Logical, Mathematical, and Computational Limits to Control

Abstract

Monthly Views for the past 3 years

Monthly Downloads for the past 3 years

Plum Analytics

Actions (login required)

ULS D-Scribe

E-Prints

Share

Feeds

Get Alerts for All New Posts