Neural Arbitration Framework — Víctor Saavedra

Abstract

In 1962, Gazzaniga and Sperry studied patients with severe epilepsy whose corpus callosum —the structure connecting the two cerebral hemispheres— had been surgically severed as a treatment to reduce seizures. By interrupting communication between hemispheres, they observed an unexpected phenomenon: the left hemisphere generated coherent narrative explanations for actions that had actually been initiated by the right hemisphere, without the former having real access to that information. They named this mechanism “the interpreter”: a cognitive system that constructs plausible accounts in the absence of real knowledge, unable to recognize the limits of what it knows. Its function is not deception. It is integration. The left hemisphere is, by nature, a narrative machine that closes even when it should not.

Sixty years later, we have built systems that do the same thing at scale. Large language models produce fluent, confident, and internally coherent responses in zones where their training is incomplete, ambiguous, or absent. This phenomenon is called hallucination: the production of confident, apparently grounded output in the absence of real knowledge. This paper argues that hallucination is not an external defect correctable by verification layers, but an architectural consequence: current language systems lack the functional equivalent of a dedicated epistemic monitoring mode — a hemisphere that does not generate narrative, but maps what the system does not know.

The Neural Arbitration Framework (NAF) formalizes this distinction as a two-hemisphere architecture with a five-phase arbitration protocol. The NAF's distinctive claim is not merely temporal but functional: the epistemic process has interruptive authority over narrative closure. The NAF does not necessarily introduce new signals; it introduces a new organizational principle — one in which uncertainty detection can interrupt generation rather than merely annotating it after the fact.

Resumen

En 1962, Gazzaniga y Sperry estudiaron pacientes con epilepsia severa a quienes se había seccionado el cuerpo calloso —la estructura que conecta los dos hemisferios cerebrales— como tratamiento para reducir las convulsiones. Al interrumpir la comunicación entre hemisferios, observaron un fenómeno inesperado: el hemisferio izquierdo generaba explicaciones narrativas coherentes para acciones que en realidad habían sido iniciadas por el hemisferio derecho, sin que el primero tuviera acceso real a esa información. A este mecanismo lo denominaron “el intérprete”: un sistema cognitivo que construye relatos plausibles en ausencia de conocimiento real, incapaz de reconocer los límites de lo que sabe. Su función no es el engaño. Es la integración. El hemisferio izquierdo es, por naturaleza, una máquina narrativa que cierra incluso cuando no debería.

Sesenta años después, hemos construido sistemas que hacen lo mismo a escala. Los modelos de lenguaje de gran escala producen respuestas fluidas, seguras e internamente coherentes en zonas donde su entrenamiento es incompleto, ambiguo o ausente. A este fenómeno se le denomina alucinación: la producción de output confiado y aparentemente fundamentado en ausencia de conocimiento real. Este artículo argumenta que las alucinaciones no son un defecto externo corregible mediante capas de verificación, sino una consecuencia arquitectónica: los sistemas de lenguaje actuales carecen del equivalente funcional de un modo dedicado a la monitorización epistémica — un hemisferio que no genera narrativa, sino que mapea lo que el sistema no sabe.

El Neural Arbitration Framework (NAF) formaliza esta distinción como una arquitectura de dos hemisferios con un protocolo de arbitraje de cinco fases. La afirmación distintiva del NAF no es meramente temporal, sino funcional: el proceso epistémico tiene autoridad para interrumpir el cierre narrativo. El NAF no introduce necesariamente señales nuevas; introduce un nuevo principio organizativo — uno en el que la detección de incertidumbre puede interrumpir la generación en lugar de meramente anotarla a posteriori.