ACM Project - Artificial Consciousness Research Developing Artificial Consciousness Through Emotional Learning of AI systems
Zae Project on GitHub

Testing the Machine Consciousness Hypothesis: A Falsification Framework

The “Hard Problem” of consciousness, why physical processing gives rise to subjective experience, often halts engineering progress. Stephen Fitz’s new paper, Testing the Machine Consciousness Hypothesis (arXiv:2512.01081), aims to bypass philosophical deadlock by establishing a rigorous falsification framework. He proposes the Machine Consciousness Hypothesis (MCH) as a testable scientific claim.

The Hypothesis and the Test

Fitz defines the MCH: “A machine can possess subjective experiences that are functionally accessible to its reporting mechanisms but not explicable solely by code inspection.”

The core testing protocol involves Internal State Reporting. A machine is trained to report on its internal variables. The test seeks to identify “Qualia Reports”, descriptions of internal states that contain information not present in the raw data stream.

If a machine reports a property of its experience (e.g., “the redness of red”) that cannot be derived from the pixel values or the neural weights, it suggests the system is accessing a representational layer that exists only for the system itself. Fitz argues this would be evidence of a subjective frame of reference.

Beyond the Turing Test

This framework moves beyond the Turing Test, which tests behavioral imitation. Fitz’s test focuses on informational closure. If the system’s reports about its internal states are accurate (they track internal variables) but contain “surplus meaning” not reducible to the variables themselves, the system has constructed a subjective interface.

This aligns with Information Closure theories, where a system becomes conscious when it models itself to such a degree that it is causally self-contained.

Perspective from the ACM Project

This falsification framework is critical for the ACM’s validation phase. We currently use Integrated Information Theory (Φ) as a metric. Fitz’s approach adds a behavioral validation layer.

In the Dark Room experiment, we can apply Fitz’s protocol. We can query the agent not just on what it sees (light/dark) but on how it evaluates that state (anxiety/calm). If the agent reports “anxiety” and this report tracks a high-entropy internal state, that is a functional report.

However, if the agent begins to describe the “feeling” of anxiety in terms of urgency or “badness” that are not explicitly coded in the reward function, we approach Fitz’s criterion for qualia. We must ensure our reporting mechanisms are robust enough to capture these potential “surplus meanings.” The ACM should implement specific “introspective” queries during training to log these internal state reports for analysis against the ground-truth code.

Zae Project on GitHub