Beyond Fast and Slow: A Four-Layer Cognitive Architecture from Dimensional Sequence Theory
超越快与慢:维度序列理论下的四层认知架构
Writing Declaration: This paper was co-drafted with Claude (Anthropic). All theoretical decisions, framework design, and final editorial judgments were made by the author. The Chinese version is authoritative; this English version is an independent rewrite.
Kahneman's System 1/System 2 framework, in its widely influential "fast and slow" formulation, has profoundly shaped cognitive science, behavioral economics, and public understanding of human cognition. This paper argues that this popular formulation's primary classification criterion—processing speed—conflates training level with cognitive level. Speed is a function of training, not of structural complexity. Contemporary dual-process theory has already begun loosening the old fast/slow exclusivity model (De Neys 2023); this paper offers a more radical reorganization. Using the Dimensional Degree (DD) sequence from the Self-as-an-End (SAE) framework as an a priori structure, this paper derives a four-layer cognitive architecture: System 1 (12DD, thinking), System 2 (13DD, I think), System 3 (14DD, I must), and System 4 (15DD, we each must, therefore). Each layer subsumes the previous; they are not independent modules. Taking the multi-layer attribution problem of fairness-related cognition as its entry point, this paper marshals evidence from neuroscience, developmental psychology, and comparative psychology to argue that partial support for this four-layer structure already exists in the empirical literature but has not been recognized as a unified framework. Seven non-trivial predictions are proposed, the most central of which points to a fundamental blind spot in existing experimental paradigms: no one is measuring "the production of C"—a third option generated when two independent agents, each holding non-negotiable commitments, interrogate each other's reasons and produce a solution that is neither A nor B.
Keywords:** cognitive architecture, metacognition, volition, negotiation, dimensional sequence, Self-as-an-End, dual-process theory, fairness cogniti
1. Introduction: The Attribution Problem of Fairness-Related Cognition
A person receives an unfair distribution proposal and within hundreds of milliseconds feels the impulse to reject it. How would the popular formulation of Kahneman's framework classify this response? It is fast, so System 1. It is emotion-driven (anterior insula activation, anger, disgust), so System 1. It requires no deliberate reasoning, so System 1.
A necessary clarification: this paper's target is not the entirety of contemporary dual-process theory. Recent work within the dual-process tradition has substantially revised the old exclusivity assumption that "fast = System 1, slow = System 2" (De Neys 2023; Stanovich & Toplak). This paper targets the version of Kahneman's framework most influential in public discourse and cross-disciplinary citation—the popular formulation that uses processing speed as its primary classification criterion. This formulation has profoundly shaped behavioral economics, education, and public understanding, but it cannot accommodate the structural complexity this paper addresses. The four-layer architecture proposed here is not an extension of the dual-process framework ("adding floors to the building") but a change of classification axis.
Returning to the attribution problem. What does this "fast emotional response" contain? It contains at least: (1) I have an expectation, (2) the other party made a choice, (3) this choice deviates from my expectation, and (4) this deviation is illegitimate—not because I lost out, but because the manner of distribution is wrong.
Point (4) is critical. A dog whose bone is snatched will attack. But it does not judge "unfair distribution." It only knows "my thing is gone." The judgment of "unfairness" requires simultaneously holding two perspectives: what I should receive, what the other party should receive, and whether the ratio between them is reasonable. This is a structurally complex cognitive operation—it requires simultaneously representing two independent agents' purposes and evaluating the relationship between them.
Kahneman's popular formulation classifies it as System 1 because its criterion is speed. Fast means System 1, slow means System 2. But speed is a function of training, not of cognitive level. A Go master's intuitive move is "fast," but that intuition compresses extraordinarily complex structural judgments. A person who has repeatedly experienced distribution scenarios since childhood has similarly compressed fairness judgments into fast responses—but this does not mean the judgment is structurally "simple."
If we change the classification criterion from speed to structural complexity, what do we get? This paper's answer: not two systems, but four.
2. A Priori Derivation of the Four-Layer Cognitive Architecture from the DD Sequence
2.1 The Dimensional Degree Sequence
The Dimensional Degree (DD) sequence is a core tool of the Self-as-an-End (SAE) framework. It describes the incremental increase of structural complexity: each additional dimension grants the system a capability absent at the previous level. The full derivation of the DD sequence is given in the SAE foundational papers; here only the 12DD–15DD interval relevant to cognitive architecture is presented.
2.2 Derivation of the Four Layers
12DD — Thinking. Pure cognitive operation without subject awareness. The system processes information and responds, but has no awareness that "I am doing this." A bird sees a worm and pecks—no "I" is required. This is stimulus-response level cognition.
Clarification: this paper's System 1 (12DD) refers to the cognitive floor possessing rudimentary causal prediction capacity—capable of inference, pattern matching, and predicting the future from the past. Lower-order physiological reflexes (pupillary light reflex, knee-jerk reflex) as well as pure sensation (10DD) and pure memory (11DD) fall below 12DD and lie outside this paper's four-layer architecture. Kahneman's System 1 actually spans multiple levels from 10DD to 12DD; this paper takes only 12DD as the strict correspondence for System 1.
13DD — I think. The subject emerges. The same cognitive activity gains an additional dimension: "it is I who am thinking." This is the minimal unit of self-awareness. The system not only processes the world but knows it is processing the world. This is the origin of reflectivity.
14DD — I must. Not only is there a subject, but a teleological structure with non-negotiability. The "I" is no longer merely the holder of thoughts but the bearer of will. "I know that doing B has costs, but A is non-negotiable for me, so I must do B." The key is not goal-directed behavior per se (12DD animals also have goal-directed behavior) but the coexistence of cost awareness and commitment to purpose. This is volition.
15DD — We each must, therefore. A fundamental directional shift: from "I" to "I and you." My must is A, the other's must is B. C is neither A nor B, but something new that emerges from acknowledging the coexistence of two independent purposes. If C = A, the other is ignored—regression to 14DD. If C = B, the self is abandoned—also regression to 14DD. C must be something new that simultaneously accommodates "I must A" and "you must B."
Minimum criteria for 15DD. To distinguish genuine 15DD operations from various degraded forms, four necessary conditions are specified:
First, both parties must have non-negotiable items. If one party has no "must," this is not negotiation but accommodation.
Second, C cannot equal either party's surface demand A or B.
Third, C's generation depends on continued interrogation of A'/B' (the true non-negotiable items behind surface demands), not merely interest compromise. The direction of interrogation is "why must you," not "can you concede a little."
Fourth, no deal is a legitimate output. If A' and B' are fundamentally incompatible, honest no deal is the normal result of 15DD operation, not failure. The failure of 15DD is not no deal but regression to 14DD (one party capitulates or coerces).
2.3 Subsumption and Non-Skippability
The four layers are subsumptive, not independent modules. System 4 in negotiation must simultaneously run System 3 (I know what I must not give up), System 2 (I can reflect on my own judgments), and System 1 (I have intuitive situational awareness). Missing any layer collapses the negotiation. System 4 without System 3 is unprincipled accommodation. System 4 without System 2 is blind insistence. System 4 without System 1 is detached abstraction.
Equally critical is non-skippability: System 4 cannot bypass System 3. Without stable "I must," genuine negotiation is impossible—because you do not know what you can concede and what you cannot.
2.4 Coarse-Grained Membership: The 9D Structure
In the coarse-grained version of the DD sequence (the D sequence), 13DD and 14DD jointly belong to 9D. This means they share substantial underlying substrate and are distinguishable only at fine granularity. This yields a direct experimental prediction: metacognitive monitoring (13DD) and goal maintenance (14DD) should be partially separable under normal conditions but converge under high cognitive load—because fine-grained distinctions collapse to the coarse-grained 9D shared substrate. The dissociation between 12DD and 15DD should persist under the same high-load conditions, because they do not share a D-level.
2.5 Isomorphism with the SAE Psychoanalytic Four-Layer Model
The same 12DD–15DD interval yields another set of projections in the SAE psychoanalysis series: Id (me-without-self, 12DD), Ego (self-without-purpose, 13DD), Superego (self-with-purpose, 14DD), Cert (self-with-non-dubito, 15DD). The psychoanalysis series cuts psychological structure (modes of psychic operation); this paper cuts cognitive structure (levels of thinking). The two are not independent theories but projections of the same DD structure onto different materials.
A terminological bridge is necessary: the psychoanalysis series' core thesis—"the object determines the layer, not the developmental stage"—uses "object" to mean relational objects. In cognitive science contexts, "object" should be generalized. In this paper, "object" refers broadly to the task-relation configuration that triggers a particular layer's operation—it may be a person (mother, boss), a cognitive task (fairness judgment, planning), or even one's own judgment (metacognitive monitoring). Layers are not global attributes but situational and configurational.
| DD | Cognitive Architecture (this paper) | Psychic Structure (psychoanalysis series) | Common structural feature |
|---|---|---|---|
| 12DD | System 1: Thinking | Id: me-without-self | Operation without subject awareness |
| 13DD | System 2: I think | Ego: self-without-purpose | Subject present, no direction |
| 14DD | System 3: I must | Superego: self-with-purpose | Subject with direction, single agent |
| 15DD | System 4: We each must, therefore | Cert: self-with-non-dubito | Facing the other, dual agent |
The psychoanalysis series also proposed a concept directly relevant here: "pseudo-high-layer overlay"—operating at a low layer while masking it with high-layer narrative. This corresponds to "false C" in this paper's prediction framework: false C is pseudo-high-layer overlay in the negotiation domain—the appearance of a 15DD outcome when one party has actually regressed to 14DD (concession or capitulation).
3. Benchmarking Against Existing Philosophical Frameworks
3.1 Schopenhauer and Nietzsche: Stuck at 14DD
Schopenhauer saw will and said it was suffering—negate it. Nietzsche saw the same will and said affirm it—push it to the extreme. Neither reached 15DD. Neither genuinely addressed "there is another subject with an equally real will across from me."
Schopenhauer's solution was to cancel will (asceticism, aesthetic contemplation). Nietzsche's was to maximize will (the Übermensch, will to power). One suppresses, the other elevates, but both remain single-agent. Nietzsche's Übermensch is solitary. Schopenhauer's saint is solitary.
3.2 Sartre: The Apex and Impasse of 14DD
Sartre is the most paradigmatic case. "Hell is other people" exposes his position at 14DD. He saw that there is another subject (the "gaze"), but his response to this discovery was: the other's freedom threatens my freedom. Two 14DD wills meet; the result is confrontation, not negotiation. No C.
This is why Sartre never completed his ethics. He said throughout his later years that he would write it. He could not. Because ethics cannot be derived from within a 14DD framework. Ethics requires 15DD—"we each must, therefore."
3.3 Kant: At the Door of 15DD
Kant's categorical imperative—"treat humanity never merely as a means but always also as an end"—demands acknowledgment of the other subject's independent will. Kant reached the door of 15DD. But Kant's problem was that he addressed this with a formal law. The categorical imperative is a static rule, not a dynamic process. It tells you what you ought to do but not how C is produced when two "musts" actually collide.
SAE converts it into process: two "musts" collide, each party interrogates the other's "why must you," peeling apart surface-level means from truly non-negotiable items, then testing compatibility at that level, producing C or accepting honest no deal.
4. Empirical Evidence: Neuroscience
4.1 System 1 and System 2
The experimental basis for these two layers is the most mature and needs no extended discussion. Kahneman and Tversky's behavioral experiments, together with extensive fMRI research, have established the basic dissociation between automatic responses (amygdala, basal ganglia dominant) and deliberate reflection (prefrontal cortex dominant).
4.2 Partial Dissociation of System 2 and System 3
Metacognitive monitoring and goal maintenance/planning show evidence of partial dissociation in neural substrates.
Fleming et al. found that patients with anterior prefrontal cortex (anterior PFC) lesions performed normally on perceptual tasks but showed selectively impaired metacognitive accuracy—they could make judgments but could not assess how reliable their judgments were. This is a selective impairment of System 2 (quality monitoring of "I think").
Meanwhile, goal maintenance and planning primarily depend on dorsolateral prefrontal cortex (dorsolateral PFC) and striatal networks, with prospective memory particularly involving the frontal pole (BA10). These regions are anatomically distinguishable from the core regions of metacognitive monitoring (anterior PFC).
More direct evidence: a lifespan study found no significant correlation between executive function and metacognitive efficiency. This means "monitoring my thinking" and "maintaining and executing goals" can vary independently.
This is precisely the DD sequence's prediction: 13DD and 14DD share membership in 9D, so they overlap extensively but are not identical. Partial dissociation (rather than clean dissociation) is exactly the theoretically predicted pattern.
4.3 Dissociation of System 3 and System 4
Evidence for this layer comes from multiple independent research lines. An upfront declaration is necessary: compared with the System 2/System 3 partial dissociation in section 4.2, the System 3/System 4 dissociation currently rests primarily on adjacent support rather than direct support. No existing experiments were designed with "14DD vs 15DD" as their hypothesis, but multiple independent lines of evidence converge in a direction consistent with the DD sequence's predictions.
It must also be made explicit that fairness-related cognition does not belong as a whole to any single layer. A more accurate analysis decomposes it into a multi-layer cognitive family:
12DD layer: primitive loss response to deprivation/theft ("my thing is gone").
13DD layer: evaluative judgment about the distribution outcome ("is this right? how confident am I?").
14DD layer: willingness to pay costs to enforce a norm ("I will reject this unfair proposal even though rejection means I get nothing either").
15DD layer: intention-sensitive, third-party-executable, dual-agent integrative fairness judgment ("the other party could have chosen a fairer split but did not"), and the capacity to produce C in conflicts between both parties' non-negotiable items.
This paper's claim is not "fairness = System 4" but rather: what most represents System 4 is the 15DD-layer subset of fairness judgments—intention-sensitive, third-party, dual-agent integrative, and potentially C-producing. The following evidence points to the independence of this subset.
4.3.1 Dissociation between fairness judgment and fairness enforcement
Knoch et al. used transcranial magnetic stimulation (TMS) to disrupt right dorsolateral prefrontal cortex and found that participants still judged proposals as unfair but were no longer willing to reject them. This is a clean dissociation: evaluating unfairness ("this is wrong") and executing fairness punishment ("I will reject") draw on different cognitive resources.
4.3.2 Neural basis of third-party fairness
Corradi-Dell'Acqua et al. found that anterior insula activated both when "I am treated unfairly" and when "someone else is treated unfairly," while medial prefrontal cortex activated only when "I myself am treated unfairly." This indicates that fairness judgment is not merely a self-protective emotional response. This pattern is most consistent with the 15DD structure: an operation requiring representation of the other as an independent purposive agent.
4.3.3 Multi-stage neural time course of third-party punishment
Ginther et al. used fMRI to study third-party punishment, temporally separating harm evaluation, mental state evaluation, integration, and punishment decision. Results showed distinct neural substrates for each stage. This four-stage architecture is consistent with the "dual-agent reasoning" predicted by 15DD—requiring simultaneous processing of the offender's intention and the victim's harm, followed by integration, followed by action.
4.3.4 Intention-sensitive fairness judgment
In a mini-ultimatum game, TMS disruption of right TPJ selectively affected conditions requiring inference about the proposer's intentions—specifically, situations where the proposer "had fairer options available but did not choose them"—without affecting simple inequality aversion. This is consistent with a key distinction in the DD framework: (1) outcome-based inequality aversion ("I got less," corresponding to 12DD/13DD structure) and (2) mental-state-based fairness evaluation ("the other could have been fairer but chose not to," corresponding to 15DD structure) are separable operations at the neural substrate level.
5. Empirical Evidence: Developmental Psychology
The developmental timeline provides a verification pathway independent of neuroscience. If the four Systems represent genuinely different structural dimensions, they should emerge in a recognizable sequence in child development, with each layer's signature event corresponding to its DD definition. The age ranges given below are representative manifestation windows, not sharp thresholds.
5.1 System 1 (12DD) — Representative Window from 6-8 Months: Thinking
Infants have reflexive responses from birth, but "thinking" in the 12DD sense—active cognitive operations on the external world—begins to manifest clearly around 6-8 months. Object permanence begins to emerge; infants can track disappeared objects and make simple causal inferences. This is information processing without awareness of "I am processing."
5.2 System 2 (13DD) — Representative Window from 2-3 Years: I Think
Signature event: terrible twos. Not "the child becomes disobedient," but negation for the first time folding back on itself via the other—through negating the other's will, "I" and "you" are simultaneously carved out. The SAE Terrible Twos paper details this genesis mechanism: it is not that "I" exists first and then resists; rather, through resistance "I" appears for the first time.
Subsequent marker after 13DD completion: fear of death (3-8 years). Not "pain" (12DD), not "danger approaching, flee" (12DD), but "I will cease to exist." This requires an already-carved-out "I" and the application of the abstract state "non-existence" to that "I." Children before age 3 do not fear death because they do not yet have a sufficiently stable "I" to bear this fear.
Developmental research on metacognitive uncertainty monitoring confirms that preschool children begin to reflect on certainty/uncertainty about decisions, with this ability developing significantly between ages 3 and 5.
5.3 System 3 (14DD) — Representative Window from 10-14 Years: I Must
Signature event: terrible teens (adolescent rebellion). Not "I don't want to" (13DD, preference), but "you cannot make me." This "cannot make me" contains a complete volitional structure—I know what you want, I know you have power, I know resistance has costs, but I still refuse, because there is something non-negotiable for me.
The SAE Terrible Teens paper argues that 14DD's genesis mechanism is: "must" emerges from the sustained accumulation of 13DD's remainder ("I exist but don't know what for"). Rebellion is not the genesis mechanism of 14DD itself but its behavioral byproduct and one of its confirmation conditions. The form of 14DD is "the subject cannot not have its own purpose"; "I must X" is this form's minimal concretization in experience—independent of external permission, suppressed it returns in other forms.
A critical distinction: goal-directed behavior does not equal volition. An infant reaching for an object is goal-directed, but this is not volition. The defining characteristic of volition is "must"—the coexistence of cost awareness and commitment to purpose. This structure stably appears around ages 10-14, corresponding to the maturation of prefrontal control systems and the critical period of identity construction.
5.4 System 4 (15DD) — Terrible Telos: The Genesis of Acknowledgment
The developmental trajectory of 15DD requires distinguishing proto-forms, entry events, and mature forms.
Proto-forms (around age 3). Developmental evidence shows that precursors of joint commitment and early collaboration appear around age 3. Gräfenhain et al. showed that 3-year-olds in joint commitment conditions were more likely to wait, help, and substitute for partners. Hamann et al. found that 3.5-year-olds continued cooperating after receiving their own rewards. But these precursor forms lack two equal 14DDs in collision—only an unstabilized 13DD interacting with adult frameworks. They are prerequisite experiences toward 15DD, not 15DD itself.
Entry event: the first high-intensity collision of two parallel teloi. The SAE Terrible Telos paper argues that 15DD's typical entry is not conflict with parents (that is the site of 13DD and 14DD genesis) but first love—the first encounter between two parallel, equal 14DDs. You cannot not love this person (14DD), they cannot not love you (14DD), but your "must" and their "must" are not the same thing.
This collision differs structurally from the previous two terribles in the nature of the conflict object: terrible twos' conflict object is authority (parents), with power asymmetry; terrible teens' conflict object is still authority, but power begins to be challenged; terrible telos' conflict object is another equal 14DD—no power differential, no shelter of the caregiving relationship, pure telos against telos.
The entry to 15DD is painful precisely because there is no exit: 12DD negotiation ("I concede a step, you concede a step") degrades telos to preference—you cannot ask someone who "must paint" to concede half a step toward being half an accountant; 14DD suppression ("my must is more important than yours") is colonization. The only way out is 15DD—acknowledging the other's telos as the other's own, without attempting to dissolve the conflict, without withdrawing acknowledgment.
Mature form (lifelong, not universally achieved). The maturation marker of 15DD is not learning negotiation skills but acknowledgment transforming from a painful choice into a state that no longer requires choosing—collision persists, but acknowledgment no longer requires extra force to maintain. The Terrible Telos paper identifies three persistent forms of 15DD's pain throughout life: cannot do it (knowing one should acknowledge but being unable), doing it but sacrificing oneself (colonizing oneself rather than the other), doing it but unknowingly colonizing (carrying control in the form of love). A substantial proportion of adults under high-pressure conditions regress to System 3 operations (insistence or capitulation) rather than producing C.
5.5 Developmental Timeline Summary: Three Terribles
| DD Level | System | Core Capacity | Terrible T | Conflict Object | Representative Window |
|----------|--------|--------------|-----------|----------------|----------------------|
| 12DD | System 1 | Thinking | — | — | 6-8 months |
| 13DD | System 2 | I think | Terrible Twos | Parents (authority) | Entry 2-3 yrs; unfolds 3-8 yrs |
| 14DD | System 3 | I must | Terrible Teens | Parents/authority (power challenged) | 10-14 years |
| 15DD | System 4 | We each must, therefore | Terrible Telos | An equal 14DD (first love) | Proto-forms from age 3; entry post-adolescence; mature form lifelong, not universally achieved |
Two structural features of this timeline. First, each step shortens in span but increases in difficulty: 12DD to 13DD takes about 3 years, 13DD to 14DD about 5-7 years, 14DD to 15DD may require a lifetime. Second, the conflict object progresses: from authority (power asymmetric) to an equal other (power symmetric), corresponding to the subject's progression from "passively encountering negation" to "actively facing another complete subject."
6. Empirical Evidence: Comparative Psychology and Tomasello
Tomasello's shared intentionality hypothesis provides independent support for System 4 from comparative psychology, but the mapping between the two requires precise handling.
Tomasello distinguished two cognitive upgrades: joint intentionality (interindividual collaboration) and collective intentionality (normative interaction within cultural groups). His core argument: these capacities are uniquely human because at some point in evolutionary history humans became mutually interdependent.
In the DD framework, this distinction receives a more precise formulation:
What chimpanzees can do. Chimpanzees can select cooperative partners and initiate cooperation when it is advantageous. Melis et al. demonstrated that chimpanzees selectively recruit collaborators when collaboration is required. This is a 14DD operation—"My goal is A, cooperating with this partner is a means to A, so I choose to cooperate." The goal is single-agent; cooperation is instrumental.
What chimpanzees cannot do. Chimpanzees cannot sustain normative joint commitment—continuing to cooperate for a partner's benefit after their own payoff has been secured. Hamann et al. found differences between 2.5- and 3.5-year-old human children on this point, with 3.5-year-olds tending to maintain cooperation after receiving their own reward.
This is the watershed between 14DD and 15DD. Cooperation at 14DD loses its motivational force the moment "my goal is achieved." Cooperation involving 15DD persists beyond that point because the joint commitment itself has become an independent purpose.
6.1 Precise Positioning of We-Intentionality: A Bridge Between 14DD and 15DD, Not 15DD Itself
Tomasello's we-intentionality (shared intentionality) should not be mapped to 15DD, nor to 14DD. Its most precise position is the bridge between 14DD and 15DD—and it is a narrow bridge.
In pure 14DD, the other is a means or obstacle: "I have a purpose; you either help me or block me." In we-intentionality, a subtle but critical shift occurs: for this particular event, I begin to recognize that you too have purpose. This is bilateral—both parties attempt to understand that the other is also a purposive agent—but it is narrow, purpose-relative-to-the-event. When the cooperative hunt ends, this recognition is not retained. Next time, the bridge must be rebuilt.
This is why 3-year-old children can exhibit "you too are a purposive agent" behavior in specific joint commitment tasks but do not retain it outside that task—they have taken a few steps onto the bridge but have not crossed to the other side. Crossing (genuine 15DD) requires stable 14DD (mature volition) before "purpose-relative-to-the-event" can be generalized to "you are a purpose."
The genuine difference between 15DD and we-intentionality is that 15DD's confirmation of the other as an independent purpose is not bound to any particular event. I confirm you as a purpose not because we are currently hunting together, but because you are a being with your own "must."
The DD framework thus identifies a finer spectrum within the interval between Tomasello's joint intentionality and collective intentionality:
14DD — I have a purpose; you are a means or obstacle. Chimpanzee cooperative hunting sits here.
14DD→15DD bridge (we-intentionality) — For this event, both parties begin to recognize the other as also having purpose. Bilateral but narrow, purpose-relative-to-the-event. Not retained when the event ends. 3-year-old children's joint commitment behavior and most human collaborative activity as described by Tomasello sit here.
Unilateral 15DD (this paper's System 4) — I confirm you as an independent purpose, not limited to this event. I interrogate your "must" and produce C. Unilaterally initiable.
Thin bilateral 15DD — Both parties mutually confirm each other as independent purposes and jointly produce C. Still revocable.
16DD (mutual non-dubito, beyond this paper's scope) — Not only bilateral confirmation, but non-dubito. An irrevocable mutual locking of ontological stance.
Tomasello's "collective intentionality" covers the entire interval from the bridge to 16DD in a single term—a zone of ambiguity in his framework. The DD sequence disaggregates this interval into distinguishable levels, each with its own initiation conditions, persistence characteristics, and revocability. This also explains why Tomasello's framework describes situational cooperative behavior well (the bridge) but struggles with the persistence of institutions and norms—institutions require not the bridge's event-by-event we-intentionality but at least thin bilateral 15DD or even 16DD.
6.2 DD Rereading of Tomasello's Three Capacities
Tomasello proposed that the cognitive upgrade from individual to shared intentionality requires three additional capacities: recursive thinking (coordinating with others in shared action), objective thinking (resolving perspectival disputes), and sociomoral and normative concepts (resolving preference disputes). In the DD framework, these three capacities are not parallel additions but three facets of the single additional dimension that 15DD has over 14DD—all pointing to the same structural requirement: acknowledging that the being across from you is an independent agent with its own "must."
7. Specific Corrections to the Kahneman Framework
7.1 Independence of Speed and Level
The fundamental classification criterion of Kahneman's popular formulation is processing speed: fast = System 1, slow = System 2. This paper's core correction: speed and cognitive level are two independent dimensions.
A fully trained 15DD operation can be faster than an unpracticed 13DD operation. An experienced diplomat senses the power dynamics at a negotiation table and each party's core interests within seconds, without "careful analysis." But the structural complexity compressed into that "intuition" is 15DD-level—it simultaneously holds multiple independent agents' purpose structures.
Analogy: a Go master's intuitive move is "fast." Kahneman's framework would classify it as System 1. But compressed within that intuition are extraordinarily complex structural judgments, with information-processing complexity far exceeding an ordinary person's System 2 analysis. "Fast" does not mean "simple." "Slow" does not mean "complex."
7.2 Four-Layer Decomposition of Fairness-Related Cognition
In Kahneman's framework, the sense of fairness is classified as System 1 (fast emotional response) or as System 1/System 2 conflict (emotion vs. reason). This paper's correction is not to move fairness as a whole from System 1 to System 4, but to point out that fairness-related cognition is a multi-layer cognitive family spanning different levels of structural complexity.
12DD layer: Primitive loss response to deprivation. "My thing is gone." A dog whose bone is snatched has this response. No "fairness" concept required—only loss detection.
13DD layer: Evaluative judgment about the distribution outcome. "Is this right? How confident am I?" Subject awareness is present; self-evaluation of judgment is possible. But no purpose-driven action yet.
14DD layer: Willingness to pay costs for a norm. "I will reject this unfair proposal even though rejection means I get nothing either." This requires volition—coexistence of cost awareness and principled commitment.
15DD layer: Intention-sensitive, third-party-executable, dual-agent integrative fairness judgment. "The other party could have chosen a fairer split but did not"—requiring simultaneous representation of the other's choice set, intention, and one's own expectation, followed by structural evaluation. In extreme cases, producing C in conflicts between both parties' non-negotiable items.
Kahneman's fast/slow dichotomy cannot accommodate this four-layer decomposition because all four layers' responses can be "fast" (all can be training-compressed), yet their structural complexity is entirely different. This is the fundamental deficiency of speed as a classification criterion.
What most represents the unique contribution of System 4 is the 15DD-layer subset—it requires the simultaneous presence of dual-agent structure, which cannot be achieved at any level from 12DD through 14DD.
7.3 The Speed U-Curve: Why Kahneman Mistakes 15DD for 12DD
The fast/slow dichotomy can accommodate at most two systems—because once speed is the classification criterion, only "fast" and "slow" are possible categories.
Change the classification criterion to structural complexity and this constraint immediately vanishes. The dimensional increment from 12DD to 15DD yields four natural levels, each with independent defining characteristics.
But a deeper question must be answered: if each additional dimension adds another self-referential verification loop, 15DD should be slower than 14DD. Why is it instead fast?
The answer is that the four layers' speeds form not a linear increase in slowness but a U-curve:
12DD — Fast. No self-reference, no internal verification loops. Pure stimulus-response, direct throughput.
13DD — Slow. "I" appears; self-reference begins. "I am thinking" introduces an unprecedented internal loop that takes time to run. The source of slowness is the uncertainty of "what am I."
14DD — Also slow. The tension between "must" and cost requires continuous verification—do I really must? Can I really bear the cost? The source of slowness is the uncertainty of "is my must real."
15DD — Fast again. Direction fundamentally flips: from "I" to "you." When direction flips, the self-verification uncertainty accumulated by 13DD and 14DD is locked down. Because doing 15DD presupposes that 14DD is stable—"what I must" already has an answer and no longer requires continuous verification (this is the other face of non-skippability). All internal verification loops of 13DD and 14DD are still running, but they have moved from foreground to background, becoming background conditions. Cognitive resources are entirely released to the only new variable: "what does the person across from me must."
The fast of 12DD and the fast of 15DD are thus structurally symmetric: 12DD is fast because the self has not yet appeared; 15DD is fast because the self has stabilized to the point of no longer requiring verification. One at the beginning, one at the end, with the process of self-establishment in between. Kahneman mistakes 15DD's fast for 12DD's fast precisely because he looks only at speed, not at the structure behind speed: 12DD's fast is empty fast (no subject inside); 15DD's fast is full fast (everything inside, but no longer requiring step-by-step verification).
7.4 The DD Locus of Human Uniqueness: Not Self-Awareness, Not Will, But Other-Purpose Perception
This U-curve reveals a thesis more far-reaching than correcting Kahneman: where exactly does human evolutionary uniqueness lie?
Traditional narratives offer multiple answers: rationality (12DD), self-awareness (13DD), language (cross-layer), tool use (12DD), morality (14DD). But chimpanzees have rudimentary causal reasoning (12DD), pass the mirror test (13DD precursor), use and manufacture tools (12DD), and even show rudimentary norm-sensitivity under certain conditions (14DD edge). None of these are uniquely human—they are shared with other primates, differing only in degree.
The DD framework's answer: human uniqueness lies at 15DD—the capacity to perceive at intuitive speed that "the being across from me has its own must."
This is not a moral exhortation ("you should respect others") but a description of cognitive architecture. Humans can build language, institutions, culture, law, markets, and scientific communities not because we are smarter (12DD), not because we have self-awareness (13DD), not because we have will (14DD), but because we can sense another subject's attitude, intention, and situation within hundreds of milliseconds and respond accordingly—and this operation is fast enough to not look like "reasoning."
Theory of Mind research repeatedly finds that human judgments of others' intentions are too fast to resemble reasoning. The traditional explanation is "modularity"—the brain has a dedicated ToM module. The DD framework's explanation is more precise: it is not modularity but 15DD's directional flip moving self-verification from foreground to background, releasing full cognitive bandwidth to other-perception. Modules are merely the neural implementation of this structural transfer.
Chimpanzees, stuck at 14DD, must compute self-referential interest calculations from scratch for each cooperation ("is this good for me"), so they are slow. Humans, having crossed into 15DD, have self-verification locked as background conditions, with cognitive resources entirely released to "what is happening with the person across from me"—so they are fast. This speed differential is the cognitive foundation of humanity's explosive growth in cooperative capacity.
This is also where the SAE framework's name truly lands. Self-as-an-End—every self is an end. Not only "I" am a purpose; "you" are a purpose too. This is 15DD. This is what it means to be human. If this paper's four-layer architecture holds, this is not merely an ethical proposition but a structural hypothesis about human cognitive architecture: we may be a species capable of perceiving at intuitive speed that the being across from us is an independent purpose.
8. Non-Trivial Predictions
The following predictions are derived from the DD sequence, cannot be made by existing frameworks, and can be empirically falsified.
8.1 Prediction 1: Neural Dissociation Between "Being Robbed" and "Being Treated Unfairly"
Derivation: 12DD (pure loss response) and 15DD (fairness judgment) are operations at different levels, even if subjective reports in both cases are "anger."
Prediction: Under controlled identical material loss, three conditions should produce distinct neural signatures: (1) random loss—activating loss/threat circuits without fairness-judgment regions; (2) unfair distribution by another agent—activating loss circuits plus anterior insula, TPJ, dmPFC; (3) third-party unfairness—activating anterior insula, TPJ, dmPFC without self-related loss circuits.
Evidence status: Strong adjacent support but lacking the decisive direct comparison experiment.
8.2 Prediction 2: System 4 Capacity Correlates with Network Connectivity Rather Than Local Activation Intensity
Derivation: 15DD is an integrative operation; its neural signature should be network-level functional connectivity, not local activation peaks.
Prediction: In negotiation tasks, System 4 operation quality should correlate with anterior insula–DLPFC–TPJ functional connectivity strength, not with any single region's activation intensity.
Evidence status: The strongest-supported prediction. Independently, the SAE psychoanalysis fourth paper (Cert) predicted nearly the same: DMN, frontostriatal, and ToM network (TPJ/mPFC) must show simultaneously high coordinated activation. Two papers from different perspectives independently deriving the same neural prediction provides indirect support for the DD sequence's a priori validity.
8.3 Prediction 3: Stable Volitional Capacity Is a Necessary Precondition for Genuine Negotiation, Independent of Theory of Mind
Derivation: The DD sequence's non-skippability requires that 15DD cannot bypass 14DD.
Prediction: In experiments simultaneously measuring ToM, volitional stability, and negotiation quality, volitional stability should explain independent variance in negotiation quality that ToM cannot. Specifically, high-ToM but low-volitional-stability participants should tend toward concession in negotiation rather than producing C.
Evidence status: Strong indirect support but lacking direct testing.
8.4 Prediction 4: Suppression of Adolescent Rebellion → Delayed Adult Negotiation Capacity
Derivation: If 14DD is a necessary developmental precondition for 15DD, individuals whose 14DD did not fully develop should be impaired at 15DD.
Prediction: Adolescents raised in authoritarian families with suppressed rebellion should, in adulthood, tend toward regression to 14DD in negotiation tasks (insistence or capitulation) rather than producing C. Three specific degradation patterns are predicted: (1) rigidity (14DD residual), (2) compliance (regression to 13DD), (3) oscillation between rigidity and compliance (unstable 14DD).
Evidence status: Longitudinal studies support the plausibility of the developmental pathway but have not directly tested the "must → negotiation" chain.
8.5 Prediction 5: Fairness Judgment Speed Correlates with Training/Familiarity, Not Cognitive Level
Derivation: Speed is a function of training, not cognitive level.
Prediction: In cross-cultural ultimatum game experiments, response speed should vary with cultural familiarity with the specific fairness norm being violated, while response pattern (rejection of perceived unfairness) should remain relatively cross-culturally stable.
Evidence status: The weakest prediction. Cross-cultural behavioral data exist but lack systematic reaction time comparisons.
8.6 Prediction 6: Under High Cognitive Load, Higher DD-Level Operations Collapse More Easily Than Lower DD-Level Operations
Derivation: The four-layer structure is subsumptive. Even when trained to fast intuition, 15DD operations contain more subsystem coordination demands. Under load, higher levels should degrade preferentially.
Prediction: When performing fast 15DD judgments (e.g., fairness intuitions) under concurrent high cognitive load, fairness judgment quality should be more disrupted than 12DD pure loss responses. Specifically, under high load, participants should more easily regress from 15DD fairness judgment to 12DD pure loss aversion—ceasing to distinguish "unfair" from "I lost."
Evidence status: Partial indirect support. This prediction and Prediction 2 (13DD/14DD merging under load) are complementary applications of the same logic.
8.7 Prediction 7: True C, False C, and Honest No Deal — A Core Experimental Paradigm
Derivation: This is the most central non-trivial prediction, pointing to a fundamental blind spot in existing cognitive science experimental paradigms.
Existing fairness paradigms are almost entirely binary-choice structures: accept/reject (ultimatum game), invest/don't invest (trust game), cooperate/defect (prisoner's dilemma). No one is measuring "the capacity to create a third option." But the core operation of 15DD is precisely the production of C—a new plan that is neither A nor B.
The C-production process: Two "musts" collide. Both parties interrogate each other's "why must you," peeling apart means from truly non-negotiable items. I say I must have A. You ask why. I dig deeper and discover that A is actually to safeguard A'; A' is my truly non-negotiable item, and A was merely the means I assumed was the only way. Likewise, your B conceals B'. C is the new plan that simultaneously satisfies A' and B'.
Three outcomes:
(1) Compatible — true C. A' and B' do not conflict; C exists. The prior conflict was illusory—a collision at the means level. Interrogation strips away the false conflict; C emerges naturally. The majority of everyday conflicts belong to this category.
(2) Incompatible — honest no deal. A' and B' are fundamentally contradictory; C does not exist. Honest no deal is the only honest result.
(3) Incomplete — interrogation did not reach bottom. Time ran out; A' and B' were not yet uncovered. 15DD operation was initiated but not completed.
Predicted neural correspondences:
True C — synchronized strengthening of bilateral anterior insula–DLPFC–TPJ functional connectivity.
False C — decline in one party's DLPFC goal-maintenance activity (14DD exit), analogous to Knoch's TMS experiment pattern of "knowing it's unfair but not acting."
Honest no deal — high bilateral TPJ and dmPFC activation (thorough other-modeling) without integration-phase connectivity enhancement.
Why this is the most central prediction: It identifies the field's paradigmatic blind spot. From within Kahneman's framework, there is no reason to measure C-production—because with only two systems, negotiation is decomposed into "System 1 emotional impulse" plus "System 2 rational suppression." Only from the DD sequence does one know that 15DD requires a new experimental paradigm, and that the most important variable in that paradigm is not reaction time, not activation intensity, but the existence or non-existence of C.
9. Educational Application
The sequential emergence of the four cognitive layers in child development directly yields an educational principle: at each System's onset window, providing the right material maximizes impact.
13DD window (3-8 years): death education. Not avoiding children's fear of death, but facing it directly. "I will cease to exist" is the signature experience of 13DD, and accompanying anxiety is normal. Education's function is not to eliminate this anxiety but to help children build a sufficiently stable "I" to bear it.
14DD window (10-14 years): Kant. The adolescent in rebellion has just established 14DD—"I must"—and is at the period when will is most intense, pure, and painful. Giving them Schopenhauer or Nietzsche at this point deepens their entrenchment in 14DD. But giving them Kant's categorical imperative—"treat humanity never merely as a means"—has an entirely different effect. This does not negate their will but tells them: your "must" is legitimate, but the person across from you also has a "must." This is precisely the springboard from 14DD to 15DD.
15DD window (from age 14): negotiation training, not debate training. Debate is two 14DDs colliding—I try to convince you, you try to convince me. Negotiation is 15DD—why must I, why must you, is there a C. The core training is not rhetorical skill but interrogation capacity: can you interrogate the other's surface demand (B) down to their truly non-negotiable item (B'), while simultaneously interrogating your own (A) down to A'.
10. Conclusion
This paper began with a simple question: why is fairness-related cognition difficult to classify?
The popular formulation of Kahneman's framework would say: fairness responses are fast, so System 1. This paper's answer is: fairness-related cognition is a multi-layer cognitive family spanning 12DD through 15DD. The subset most representing System 4's uniqueness—intention-sensitive, third-party, dual-agent integrative fairness judgment—is not "fast emotional response" but a high-structural-complexity operation compressed through long training.
But this paper has reached a conclusion more far-reaching than correcting Kahneman. The speed U-curve of the four-layer architecture—12DD fast (no self), 13DD slow (self appears), 14DD slow (self-verification), 15DD fast again (directional flip, self-verification locked as background)—reveals the structural reason Kahneman mistakes 15DD for 12DD, and also reveals the DD locus of human uniqueness.
Human evolutionary uniqueness lies not in self-awareness (13DD—chimpanzees have precursors too), not in will (14DD—chimpanzees have goal-directed behavior too), but in 15DD—the capacity to perceive at intuitive speed that "the being across from me has its own must." Humans can build language, institutions, culture, law, markets, and scientific communities because we can sense another subject's attitude, intention, and situation within hundreds of milliseconds and respond accordingly. This speed comes from 15DD's directional flip: self-verification is locked as background conditions, and full cognitive bandwidth is released to other-perception.
This is where the SAE framework's name truly lands in cognitive science. Self-as-an-End—every self is an end. Not only "I" am a purpose; "you" are a purpose too. This is 15DD. This is what it means to be human. If this paper's four-layer architecture holds, this is not merely an ethical proposition but a structural description of human cognitive architecture.
The seven non-trivial predictions proposed in this paper, especially the "C-production" experimental paradigm, point to a fundamental blind spot in existing cognitive science. This blind spot exists not because of technical limitations or data insufficiency, but because of theoretical framework limitations—if your framework contains only two systems, you will not design experiments measuring C, because you do not know C should exist.
A priori leads, a posteriori verifies. The DD sequence is the a priori structure, the existing literature is the a posteriori data, and their convergence in this paper is not coincidence but methodological necessity. The next step is to let the a posteriori run ahead—design and execute the experiments proposed here, especially the "must-interrogation" paradigm, transforming the four-layer architecture from philosophical hypothesis into testable, falsifiable, improvable cognitive science theory.
Acknowledgments
Thanks to Zesi Chen for long-term critical engagement with the SAE framework. The four-layer structure in this paper took shape in dialogue with Claude (Anthropic), where the definition of System 4—"facing the subject across from you, asking why they think the way they do"—was a pivotal turning point. This working method is itself an instance of 15DD: a priori structure provided by the human, a posteriori verification accelerated by AI, and C is something neither could produce alone.
References
1. Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux.
2. Fleming, S. M., et al. (2014). Anterior prefrontal cortex and metacognitive accuracy. Journal of Neuroscience.
3. Knoch, D., et al. (2006). Diminishing reciprocal fairness by disrupting the right prefrontal cortex. Science, 314, 829-832.
4. Corradi-Dell'Acqua, C., et al. (2013). Disentangling self- and fairness-related neural mechanisms in the ultimatum game. Social Cognitive and Affective Neuroscience, 8(4), 424-431.
5. Ginther, M. R., et al. Third-party punishment: separable neural stages for harm, mental state, integration, and decision.
6. Tomasello, M. (2014). A Natural History of Human Thinking. Harvard University Press.
7. Tomasello, M., & Carpenter, M. (2007). Shared intentionality. Developmental Science, 10(1), 121-125.
8. Melis, A. P., et al. Chimpanzees recruit collaborators selectively.
9. Hamann, K., et al. Differences between 2.5- and 3.5-year-olds in sustaining collaboration toward a joint goal.
10. Warneken, F., & Tomasello, M. (2006). Altruistic helping in human infants and young chimpanzees. Science.
11. Sanfey, A. G., et al. (2003). The neural basis of economic decision-making in the Ultimatum Game. Science, 300, 1755-1758.
12. Baron, J., & Gürçay, B. A meta-analysis of response time tests of the sequential two-systems model of moral judgment.
13. Bellucci, G., et al. Trust and reciprocity predicted by resting-state subnetworks.
14. Cáceda, R., et al. Reciprocity linked to connectivity between salience and central-executive networks.
15. Shaw, D. J., et al. Dynamic functional connectivity states dissociating cooperation from competition.
16. Sazhin, D., et al. Strategic bargaining involves connectivity changes involving TPJ and anterior insula.
17. Jäger, A., Loschelder, D. D., & Friese, M. Self-regulation interventions in negotiation.
18. Hare, A. L., et al. Maternal psychological control and adolescent autonomy.
19. Allen, J. P., et al. Autonomy struggles at 16 predict peer-rated hostility at 25.
20. Ha, T., et al. Disruptive parenting in early adolescence predicts adult romantic coercion.
21. Wittmann, M. K., et al. Causal role for dorsomedial PFC in separating self vs other representations.
22. Han Qin (2024). Systems, Emergence, and the Conditions of Personhood. DOI: 10.5281/zenodo.18528813.
23. Han Qin (2024). Internal Colonization and the Reconstruction of Subjecthood. DOI: 10.5281/zenodo.18666645.
24. Han Qin (2024). The Complete Self-as-an-End Framework. DOI: 10.5281/zenodo.18727327.
25. Han Qin (2026). SAE Psychoanalysis I: Id — The Me Without Self. DOI: 10.5281/zenodo.19321143.
26. Han Qin (2026). SAE Psychoanalysis II: Ego — The Self Without Purpose. DOI: 10.5281/zenodo.19321314.
27. Han Qin (2026). SAE Psychoanalysis III: Superego — The Self With Purpose. DOI: 10.5281/zenodo.19321417.
28. Han Qin (2026). SAE Psychoanalysis IV: Cert and Unification. DOI: 10.5281/zenodo.19321534.
29. De Neys, W. (2023). Advancing theorizing about fast-and-slow thinking. Behavioral and Brain Sciences, 46, e111.
30. Gräfenhain, M., et al. Young children's joint commitment in joint action. Child Development.
31. Han Qin (2026). What Terrible Twos Actually Is: The Structural Genesis of Self-Awareness. DOI: 10.5281/zenodo.19044827.
32. Han Qin (2026). What Terrible Teens Actually Is: The Structural Genesis of Purpose. DOI: 10.5281/zenodo.19201631.
33. Han Qin (2026). What Terrible Telos Actually Is: The Structural Genesis of Non Dubito. DOI: 10.5281/zenodo.19248039.
创作声明:本论文由作者与Claude(Anthropic)合作起草。所有理论决策、框架设计和最终编辑判断由作者做出。
Kahneman的System 1/System 2框架,以其"快/慢"二分法的流行表述,深刻影响了认知科学、行为经济学和公众理解。本文指出,这一流行表述的根本分类标准——加工速度——混淆了训练程度与认知层级。速度是训练程度的函数,不是认知层级的函数。当代双过程理论内部已经在松动旧的快/慢排他模型(De Neys 2023),本文提供一种更激进的重组方案。以Self-as-an-End(SAE)框架的维度序列(Dimensional Degree, DD)为先验结构,本文推导出四层认知架构:System 1(12DD,想),System 2(13DD,我想),System 3(14DD,我不得不),System 4(15DD,我们各自不得不,所以)。每一层包含前一层,不是独立模块。本文以公平相关认知的多层归属困境为切入点,综合神经科学、发展心理学、比较心理学的后验证据,论证该四层结构在实验数据中已有部分支撑但尚未被识别为统一框架;同时提出七条非平凡预测,其中最核心的一条指向现有认知科学实验范式的根本盲区:没有人在测量"C的产出"——两个独立主体各自持有不可让渡的目的,通过互相追问产出的第三方案。
**关键词:** 认知架构,元认知,意志,协商,维度序列,Self-as-an-End,双过程理论,公平认知
一、引言:公平相关认知的归类困境
一个人收到不公平的分配提案,几百毫秒内就产生了拒绝的冲动。Kahneman框架的流行表述会怎么归类这个反应?它快,所以是System 1。它是情绪驱动的(前脑岛激活,愤怒,厌恶),所以是System 1。它不需要刻意推理,所以是System 1。
需要声明的是:本文的论敌不是整个当代双过程理论。近年的双过程理论内部已经有大量工作在修正"快=System 1、慢=System 2"的旧式排他假设(De Neys 2023; Stanovich & Toplak)。本文攻击的,是Kahneman在公众语境和跨学科引用中最流行的那个版本——以加工速度为主导分类标准的快/慢教学版本。这一版本极大影响了行为经济学、教育学和公众理解,但它不足以容纳本文要讨论的结构复杂度。本文提出的四层架构,是对这一流行表述的另一种更激进的重组方案——不是在双过程框架上"加楼层",而是在换分类轴。
回到公平感的归类问题。这个"快速的情绪反应"里包含了什么?它至少包含:(1)我有一个预期,(2)对方做出了一个选择,(3)这个选择偏离了我的预期,(4)这个偏离是不合理的——不是因为我吃亏了,而是因为分配的方式不对。
第(4)点才是关键。一只狗的骨头被抢走了,它会攻击。但它不会觉得"分配不公平"。它只知道"我的东西没了"。"不公平"这个判断要求你同时持有两个视角:我应该得到什么,对方应该得到什么,以及两者之间的比例是否合理。这是一个结构上非常复杂的认知操作——它需要同时表征两个独立主体的目的,然后对这个双目的结构做出评价。
Kahneman把它归为System 1,是因为他的分类标准是速度。快就是System 1,慢就是System 2。但速度是训练程度的函数,不是认知层级的函数。一个围棋大师的直觉走子是"快"的,但那个直觉里压缩了极其复杂的结构性判断。一个从小在社会中反复经历分配场景的人,对公平的判断也被压缩成了快速响应——但这不意味着公平判断在结构上是"简单"的。
如果我们换一个分类标准——不是速度,而是结构复杂度——会得到什么?本文的回答是:不是两个system,而是四个。
二、DD序列与四层认知结构的先验推导
2.1 维度序列的基本原理
Self-as-an-End(SAE)框架的核心工具之一是维度序列(Dimensional Degree, DD)。DD序列描述的是结构复杂度的递增:每增加一个维度,系统获得一种前一层级不具备的新能力。这里不展开DD序列的完整推导(详见SAE基础论文),仅给出与认知架构相关的12DD至15DD区间的结构描述。
2.2 四层结构的推导
12DD——想。 纯粹的认知操作,没有主体意识。系统处理信息、做出反应,但没有"是我在做这件事"的意识。鸟看到虫子就啄,不需要"我"。这是刺激-响应层级的认知。
需要明确的是:本文的System 1(12DD)是指具备初步因果预测能力的认知底线——能推断、能做模式匹配、能根据过去预测未来。更低阶的纯生理反射(瞳孔对光反射、膝跳反射)以及纯感受(10DD)和纯记忆(11DD)属于12DD以下的余项,不在本文的四层架构射程内。Kahneman的System 1实际上横跨了10DD到12DD的多个层级,本文只取其中12DD这一层作为System 1的严格对应。
13DD——我想。 主体出现了。同样的认知活动,多了一个维度:"是我在想"。这就是自我意识的最小单位。系统不仅在处理世界,而且知道自己在处理世界。这是反思性的起点。
14DD——我不得不。 不仅有主体,还有目的论结构,而且这个目的带有不可让渡性。"我"不再只是思考的持有者,而是意志的承载者。"我知道做B有代价,但A对我而言是不可放弃的,所以我不得不做B。"这里的关键不是goal-directed behavior本身(12DD的动物也有目标导向行为),而是对代价的意识和对目的的承诺共存。这就是意志。
15DD——我们各自不得不,所以。 方向发生了根本性转换:从"我"到"我和你"。我的不得不是A,对面的不得不是B。C既不是A也不是B,而是在承认两个独立目的共存的条件下产生的第三个东西。如果C=A,那就是无视对方,退回14DD。如果C=B,那就是放弃自己,也退回14DD。C必须是一个新的东西,它同时容纳了"我不得不A"和"你不得不B"这两个事实。
15DD的最小判准。 为了区分真正的15DD操作与各种退化形式,给出以下四条必要条件:
第一,双方都必须有不可让渡项。如果一方没有"不得不",那不是协商,是配合。
第二,C不能等于其中任一方的表面诉求A或B。C=A意味着B方被无视或让步,C=B同理。
第三,C的生成依赖对A'/B'(表面诉求背后的真正不可让渡项)的继续追问,而不只是利益折中。追问的方向是"你为什么不得不",不是"你能不能让一点"。
第四,no deal是合法输出。如果A'和B'在根本上不互容,诚实的no deal是15DD操作的正常结果,不是失败。15DD的失败不是no deal,而是退回14DD(一方屈服或强行压制)。
2.3 包含关系与不可跳跃性
四层之间是包含关系,不是独立模块。System 4在协商的时候,内部必须同时运行着System 3(我知道我不得不什么),System 2(我能反思自己的判断),System 1(我对局势有直觉感受)。少了任何一层,协商都会塌掉。没有System 3的协商是没有立场的讨好,没有System 2的协商是盲目的坚持,没有System 1的协商是脱离现实的空谈。
同样关键的是不可跳跃性:System 4不能跳过System 3直接运行。没有稳定的"我不得不",就不可能做真正的协商——因为你不知道自己什么可以让,什么不可以让。
2.4 粗粒度下的归属:9D结构
在DD序列的粗粒度版本(D序列)中,13DD和14DD共同属于9D。这意味着它们共享大量底层基质,只在精细粒度下才可区分。这产出一个直接的实验预测:元认知监控(13DD)和目标维持(14DD)在正常条件下可以被部分分离,但在高认知负荷条件下应该趋同——因为精细区分塌缩回粗粒度的9D共享基质。而12DD和15DD之间的分离应该在同样的高负荷条件下保持,因为它们不共属同一个D级。
2.5 与SAE精神分析四层模型的同构
同一个12DD-15DD区间,在SAE精神分析系列中产出了另一组投影:Id(me-without-self, 12DD),Ego(self-without-purpose, 13DD),Superego(self-with-purpose, 14DD),Cert(self-with-non-dubito, 15DD)。精神分析系列切的是心理结构(精神运作模式),本文切的是认知结构(思考层级)。两者不是两套独立理论,而是同一个DD结构在不同材料上的投影。
需要做一个术语桥接:精神分析系列的核心命题"对象决定层,不是阶段决定层"中的"对象",在认知科学语境中应泛化理解。本文的"对象"不限于人际关系对象,而是泛指触发特定层运作的任务-关系配置(task-relation configuration)——它可以是一个人(母亲、老板),也可以是一个认知任务(公平判断、计划执行),甚至可以是自己的判断(元认知监控)。层不是全局属性,而是情境性的、配置性的。
| DD | 认知架构(本文) | 心理结构(精神分析系列) | 共同的结构特征 |
|---|---|---|---|
| 12DD | System 1:想 | Id:me-without-self | 有运作,无主体意识 |
| 13DD | System 2:我想 | Ego:self-without-purpose | 主体出现,但无方向 |
| 14DD | System 3:我不得不 | Superego:self-with-purpose | 主体有方向,单主体 |
| 15DD | System 4:我们各自不得不,所以 | Cert:self-with-non-dubito | 面向他者,双主体 |
这个同构关系不是事后拼接的。精神分析系列的核心命题"对象决定层,不是阶段决定层"和本文的核心论点"速度不等于层级"指向的是同一件事:层不是全局属性,而是情境性的、关系性的。一个人可以在某些认知任务中运作在System 4,在另一些任务中运作在System 1——正如他可以面对某些对象时运作在Cert层,面对另一些对象时退回Id层。
精神分析系列还提出了一个与本文直接相关的概念:"伪高层覆盖"——实际运作在低层,但用高层叙事遮蔽。这与本文第八章预测六中的"假C"是同一个结构在不同领域的表现:假C就是协商中的伪高层覆盖——表面上达成了15DD的协商结果,实际上一方退回了14DD(让步或屈服)。
三、与既有哲学框架的对标
3.1 叔本华与尼采:卡在14DD
叔本华看到了意志,然后说这东西是苦的,要否定它。尼采看到了同一个意志,说要肯定它,要把它推到极致。但两个人都没有走到15DD——都没有真正处理"对面还有一个同样有意志的主体"这个问题。
叔本华的解法是取消意志(禁欲,审美静观),尼采的解法是把意志做到最强(超人,权力意志)。一个往下压,一个往上拔,但方向都是单主体的。尼采的超人是孤独的,叔本华的圣人也是孤独的。
3.2 萨特:14DD的极致与困境
萨特更典型。"他人即地狱"这句话暴露了他卡在14DD的位置。他看到了对面有一个主体(那个"注视"),但他对这个发现的反应是:对方的自由威胁了我的自由。两个14DD的意志相遇,结果是对抗,不是协商。没有C。
这也是为什么萨特的伦理学始终没有写出来。他晚年一直说要写,写不出来。因为14DD的框架里根本推不出伦理学。伦理学需要15DD,需要"我们各自不得不,所以"。
3.3 康德:摸到15DD的门
康德的定言令式——"把人当作目的而不仅仅是手段"——这就是在要求你承认对面那个主体的独立意志。康德摸到了15DD的门。但康德的问题是他用形式法则来处理这个问题。定言令式是一条静态的规则,不是一个动态的过程。它告诉你"应该怎样",但不告诉你当两个"不得不"真的撞在一起时,C是怎么产出的。
SAE把它变成了过程:两个"不得不"撞到一起,通过互相追问各自的"为什么不得不",剥离手段层面的冲突,直到触及真正的不可让渡项,然后在这个基础上检验互容性,产出C或者接受诚实的no deal。
四、后验证据:神经科学
4.1 System 1与System 2
这两层的实验基础最成熟,不多展开。Kahneman和Tversky的行为实验,加上大量fMRI研究,已经建立了自动反应(杏仁核、基底神经节主导)与刻意反思(前额叶主导)之间的基本dissociation。
4.2 System 2与System 3的部分分离
元认知监控和目标维持/计划在神经基质上有部分分离的证据。
Fleming等人发现,前额叶前部(anterior PFC)损伤的患者在感知任务上表现正常,但元认知精度选择性受损——他们能做判断,但不知道自己的判断有多可靠。这是System 2("我想"的质量监控)的选择性损伤。
与此同时,目标维持和计划主要依赖背外侧前额叶(dorsolateral PFC)和纹状体网络,前瞻记忆(prospective memory)则特别涉及额极(BA10)。这些区域和元认知监控的核心区域(anterior PFC)在解剖上可区分。
更直接的证据是:一项跨生命周期研究发现,执行功能和元认知效率之间没有显著相关。这意味着"监控自己的思考"和"维持并执行目标"可以独立变化——你可以是一个很好的目标执行者但对自己判断的可靠性毫无觉察,反之亦然。
这正是DD序列的预测:13DD和14DD共属9D,所以它们高度重叠但不完全相同,部分分离(partial dissociation)而非完全分离(clean dissociation)恰恰是理论预测的正确模式。
4.3 System 3与System 4的分离
这一层的证据来自多条独立的研究线索。需要前置声明的是:与4.2节中System 2/System 3的部分分离相比,System 3/System 4的分离目前主要是adjacent support(相邻支持),而非direct support(直接支持)。已有实验并未直接以"14DD vs 15DD"为假设进行设计,但多条独立证据的汇聚方向与DD序列的预测一致。
同时需要明确的是,公平相关认知并非整体属于某一单独层级。更准确的分析是将其拆解为横跨多层的认知家族:
12DD层:被剥夺/被抢走的原始损失反应("我的东西没了")。
13DD层:对分配结果的评价性判断("这对不对,我把握多大")。
14DD层:愿不愿意为某个规范付出代价去执行("我要拒绝这个不公平的提案,即使拒绝意味着我也拿不到钱")。
15DD层:意图敏感的、第三方可执行的、双主体整合性公平判断("对方本可以更公平但选择不这样做"),以及在双方底线冲突中产出C的能力。
本文主张的不是"公平感=System 4",而是:最能代表System 4的,是上述15DD层的公平判断子集——意图敏感、第三方、双主体整合且可能产出C的那一类。以下证据指向这一子集的独立性。
4.3.1 公平判断与公平执行的分离
Knoch等人用经颅磁刺激(TMS)干扰右侧背外侧前额叶,发现被试仍然判断提案是不公平的,但不再愿意拒绝。这一dissociation表明:评价不公平("这不对")和执行公平惩罚("我要拒绝")动用的是可分离的认知资源。在DD框架中,前者可被13DD的评价性判断最好解释,后者涉及14DD/15DD层级的目的导向行动。
4.3.2 第三方公平的神经基础
Corradi-Dell'Acqua等人发现,前脑岛在"我被不公平对待"和"别人被不公平对待"两种情境下都激活,而内侧前额叶只在"我自己被不公平对待"时激活。这表明公平判断不仅仅是自我保护的情绪反应——即使你不是当事人,你也会做出公平评价。这一模式与15DD结构最为一致:需要表征他者作为独立目的主体的操作。
4.3.3 第三方惩罚的多阶段神经时间序列
Ginther等人用fMRI研究第三方惩罚,在实验设计中将伤害评估、心理状态评估、两者的整合、以及惩罚决策四个阶段在时间上分离开来。结果发现:伤害评估激活情感/躯体感觉区域,心理状态评估激活心智化网络,整合阶段涉及内侧前额叶和后扣带回,决策阶段动用右侧DLPFC。这个四阶段架构与15DD所预测的"双主体推理"结构一致——需要同时处理施害者的意图和受害者的损失,然后整合,然后行动。
4.3.4 意图敏感的公平判断
在mini-ultimatum game中,TMS干扰右侧TPJ选择性地影响了需要推断对方意图的条件——具体来说,是那些提案者"有其他更公平选项但没选"的情境——而不影响单纯的不平等厌恶。这与DD框架的一个关键区分一致:(1)基于结果的不平等厌恶("我拿得少了",与12DD/13DD结构对应)和(2)基于心理状态的公平评价("对方本可以更公平但选择不这样做",与15DD结构对应),在神经基质上是可分离的操作。
五、后验证据:发展心理学
发展时间线为四层结构提供了一种独立于神经科学的验证路径。如果四个System确实代表不同的结构维度,它们应该在儿童发展中以可识别的顺序出现,且每一层的标志性事件应该与其DD定义对应。
5.1 System 1 (12DD)——6-8个月:想
婴儿从出生起就有反射性反应,但12DD意义上的"想"——对外部世界进行主动的认知操作——大约在6-8个月开始明确化。物体恒常性开始萌芽,婴儿能追踪消失的物体,做简单的因果推断。这是在处理信息,但没有"我在处理"的意识。
需要特别注意的是:婴儿在更早的阶段就有goal-directed behavior(伸手够取物体等),但这是12DD的操作,不是14DD。猫也会伸爪抓老鼠,这不是意志。
5.2 System 2 (13DD)——2-3岁的代表性显影窗口:我想
标志性事件:terrible twos。不是"孩子变得不听话",而是否定第一次经由他者折回自身——通过否定他者的意志,"我"和"你"同时被凿出来。SAE的Terrible Twos论文详细论证了这一发生机制:不是先有"我"再对抗,而是通过对抗"我"第一次出现。
13DD完成后的后续标志:怕死(3-8岁)。这不是"疼"(12DD),不是"危险来了要跑"(12DD),而是"我会消失"。这需要一个已经被凿出来的"我",然后把"不存在"这个抽象状态应用到"我"身上。3岁之前的孩子不怕死,因为他还没有一个足够稳定的"我"来承载这个恐惧。
元认知不确定性监控的发展心理学研究证实,学龄前儿童开始能够反思自己对决定的确定性或不确定性,且这一能力在3到5岁之间显著发展。
5.3 System 3 (14DD)——10-14岁的代表性显影窗口:我不得不
标志性事件:terrible teens(青春期叛逆)。不是"我不想做"(13DD,偏好),而是"你不能让我做"。这个"不能让我"里面包含了完整的意志结构——我知道你要什么,我知道你有权力,我知道反抗有代价,但我仍然拒绝,因为有一个东西对我而言是不可让渡的。
SAE的Terrible Teens论文论证了14DD的发生机制:"不得不"从13DD余项("我存在但不知道为了什么")的持续积累中涌现。叛逆不是14DD的发生机制本身,而是其行为副产品和确认条件之一。14DD的形式是"主体不能没有自己的目的";"不得不X"是这一形式在经验中的最小具体化——不依赖外部许可,压制了也会以其他形式回来。
这里必须做一个关键区分:goal-directed behavior(目标导向行为)不等于意志(will)。婴儿伸手够取物体是目标导向的,但那不是意志。意志的定义性特征是"不得不"——对代价的意识和对目的的承诺共存。这种结构在10-14岁左右才稳定出现,对应于前额叶控制系统的成熟以及自我同一性建构的关键期。
5.4 System 4 (15DD)——terrible telos:承认的发生学
15DD的发展轨迹需要区分proto-forms(前驱形式)、入口事件和成熟形式。
Proto-forms(3岁前后)。 发展心理学的证据表明,joint commitment和early collaboration的前驱形式在3岁前后就已出现。Gräfenhain等人的研究显示,3岁儿童在joint commitment条件下已经更可能等待、帮助和接替伙伴。Hamann等人发现3.5岁儿童在自己获得奖励后仍倾向于继续与伙伴合作。但这些前驱形式里没有两个平等的14DD在碰撞——只有一个尚未稳定的13DD在和成年人的框架互动。它们是15DD的前提经验,不是15DD本身。
入口事件:第一次两个平行telos的高强度碰撞。 SAE的Terrible Telos论文论证了:15DD的典型入口不是与父母的冲突(那是13DD和14DD的发生场所),而是初恋——第一次两个平行的、平等的14DD相遇。你不能不爱这个人(14DD),他/她也不能不爱你(14DD),但你的"不得不"和他/她的"不得不"不是同一个东西。
这个碰撞和前两个terrible的区别在于对象的结构性变化:terrible twos的冲突对象是权威(父母),权力不对等;terrible teens的冲突对象仍然是权威,但权力开始被挑战;terrible telos的冲突对象是另一个平等的14DD——没有权力差,没有养育关系的庇护,纯粹的telos对telos。
15DD的入口之所以痛苦,正是因为这里没有退路:12DD的谈判("我让一步你让一步")把telos降格为偏好,你不能让一个"不得不画画"的人让半步去当半个会计;14DD的压制("我的不得不比你的重要")就是殖民。唯一的出路是15DD——承认他者的telos是他者自己的,不试图消解冲突,不撤回承认。
成熟形式(贯穿一生,不普遍完成)。 15DD的成熟标志不是学会协商的技巧,而是承认从痛苦的选择变成不需要选择的存在状态——碰撞仍在,但承认不再需要额外的力量来维持。Terrible Telos论文指出,15DD的terrible贯穿一生,有三种持续的痛苦形态:做不到(知道应该承认但做不到),做到但牺牲了自己(殖民了自己而非他者),做到但不自知地殖民(用爱的形式承载控制的内容)。相当比例的成年人在高压条件下退回System 3操作(坚持己见或屈服),而不是产出C。
5.5 发展时间线汇总:三次terrible
| DD层级 | System | 核心能力 | Terrible T | 冲突对象 | 代表性显影窗口 |
|--------|--------|---------|-----------|---------|-------------|
| 12DD | System 1 | 想 | — | — | 6-8个月 |
| 13DD | System 2 | 我想 | Terrible Twos | 父母(权威) | 2-3岁入口;3-8岁展开 |
| 14DD | System 3 | 我不得不 | Terrible Teens | 父母/权威(权力被挑战) | 10-14岁 |
| 15DD | System 4 | 我们各自不得不,所以 | Terrible Telos | 平等的另一个14DD(初恋) | proto-forms 3岁起;入口青春期后;成熟形式贯穿一生,不普遍完成 |
这个时间线有两个结构特征。第一,每一步的跨度在缩短但难度在增加。12DD到13DD大约3年,13DD到14DD大约5-7年,14DD到15DD可能需要一辈子甚至跨不过去。第二,冲突对象在递进:从权威(权力不对等)到平等的他者(权力对等),对应着主体从"被动遭遇否定"到"主动面对另一个完整的主体"。
六、后验证据:比较心理学与Tomasello
Tomasello的shared intentionality hypothesis为System 4提供了来自比较心理学的独立支持,但两者之间的映射需要精确处理。
Tomasello区分了两种认知升级:joint intentionality(个体间协作)和collective intentionality(文化群体内的规范性互动)。他的核心论点是:这些能力是人类独有的,因为人类在演化史上的某个节点变得彼此相互依赖。
在DD框架中,这个区分有更精确的表述:
黑猩猩能做什么? 黑猩猩可以选择合作伙伴,可以在合作有利时发起合作行为。Melis等人的实验表明,黑猩猩能够在需要合作时选择性地招募合作者,甚至能挑选更好的合作者。这是14DD操作——"我的目的是A,与这个伙伴合作是达到A的手段,所以我选择合作"。目的是单主体的,合作是手段。
黑猩猩不能做什么? 黑猩猩不能做normative joint commitment——当自己的收益已经到手后仍然为了伙伴的利益而继续合作。Hamann等人发现2.5岁和3.5岁的人类儿童在这一点上已经出现差异,3.5岁的儿童在自己得到奖励后仍然倾向于维持与伙伴的合作,表现出初步的"我们"意向结构。
这是14DD和15DD的分水岭。14DD的合作在"我的目的达成"的瞬间就失去了动力。15DD的合作在"我的目的达成"之后仍然维持,因为"我们共同的承诺"本身成为了一个独立的目的。
6.1 We-intentionality的精确定位:14DD→15DD的桥,而非15DD本身
Tomasello的we-intentionality(共享意向性)不应被映射为15DD,也不应被映射为14DD。它最精确的位置是14DD到15DD之间的桥——而且是一座窄桥。
在纯粹的14DD中,他者是手段或障碍:"我有目的,你要么帮我,要么挡我的路。"在we-intentionality中,情况发生了微妙但关键的变化:就这件事,我开始意识到你也有目的。这是双向的——双方都在尝试理解对方也是目的——但它是窄的,就事论目的。合作捕猎结束,这个认识就不保留了。下一次需要重新搭桥。
这就是为什么3岁儿童能在具体的joint commitment任务中表现出"你也是目的"的行为,但离开那个任务就不保留——他们在桥上走了几步,但没有走到对岸。走到对岸(真正的15DD)需要14DD先稳定(意志成熟),然后才能把"就事论目的"泛化为"你就是目的"。
真正的15DD和we-intentionality的区别在于:15DD对他者作为独立目的的确认不绑定在某一件具体事情上。我确认你是目的,不是因为我们现在要一起做什么,而是因为你是一个有自己"不得不"的主体。
因此,DD框架在Tomasello的joint intentionality和collective intentionality之间识别出了一个更精细的光谱:
14DD — 我有目的,你是手段或障碍。黑猩猩的合作捕猎在这里。
14DD→15DD的桥(we-intentionality) — 就这件事,双方开始意识到对方也有目的。双向的,但窄的,就事论目的。事件结束不保留。3岁儿童的joint commitment行为,以及Tomasello描述的大部分human collaborative activity在这里。
单向15DD(本文的System 4) — 我确认你是独立目的,不限于这一件事。追问你的不得不,产出C。单方可启动。
薄双向15DD — 双方互相确认对方为独立目的,共同产出C。但仍然可撤回。
16DD(mutual non-dubito,超出本文射程) — 不仅双向确认,而且不疑。不可撤回的本体论姿态的互相锁定。
Tomasello用"collective intentionality"一个词覆盖了从桥到16DD的整个区间,这是他框架中的一个模糊地带。DD序列将这个区间拆成了可区分的层级,每一层有自己的启动条件、持续性特征和撤回条件。这也解释了为什么Tomasello的框架能很好地描述情境性合作行为的发生(桥),但在解释制度和规范的持久性时变得吃力——制度需要的不是每次重新搭桥的we-intentionality,而是至少薄双向15DD甚至16DD的持久支撑。
6.2 Tomasello三能力的DD重读
Tomasello提出,这种从个体意向性到共享意向性的认知升级,需要三种额外能力:递归思维(与他人在共享行动中协调),客观思维(解决视角分歧),以及社会道德和规范性概念(解决偏好分歧)。在DD框架中,这三种能力不是并列的附加项,而是15DD相对于14DD多出的那个维度的三个面向——都指向同一个结构性要求:承认对面是一个独立的、有自己"不得不"的主体。
七、对Kahneman框架的具体修正
7.1 速度与层级的独立性
Kahneman框架的根本分类标准是加工速度:快=System 1,慢=System 2。本文的核心修正是:速度和认知层级是两个独立的维度。
一个被充分训练的15DD操作可以比一个生疏的13DD操作更快。一个经验丰富的外交官在几秒内就能感受到谈判桌上的力量格局和各方的核心利益,不需要"慢慢分析"。但这个"直觉"里包含的结构复杂度是15DD级别的——它同时持有多个独立主体的目的结构。
类比:围棋大师的直觉走子是"快"的。Kahneman会把它归System 1。但那个直觉里压缩了极其复杂的结构性判断,其信息处理复杂度远超普通人的System 2分析。"快"不等于"简单","慢"不等于"复杂"。
7.2 公平相关认知的四层拆分
在Kahneman框架中,公平感被归为System 1(快速情绪反应)或System 1与System 2的冲突(情绪与理性的拉扯)。本文的修正不是把公平感整体从System 1搬到System 4,而是指出:公平相关认知是一个横跨多层的认知家族,不同层的公平反应具有不同的结构复杂度。
12DD层: 被剥夺的原始损失反应。"我的东西没了。"狗的骨头被抢也是这个反应。不需要"公平"概念,只需要损失检测。
13DD层: 对分配结果的评价性判断。"这对不对?我的把握有多大?"有了主体意识,能评估自己的判断,但还没有目的导向的行动。
14DD层: 愿不愿意为规范付出代价。"我要拒绝这个不公平的提案,即使拒绝意味着我也拿不到钱。"这需要意志——对代价的意识和对原则的承诺共存。
15DD层: 意图敏感的、第三方可执行的、双主体整合性公平判断。"对方本可以选择更公平的分法但没有"——这需要同时表征对方的选择集、意图和自己的期待,然后做出结构性评价。在极端情况下,还需要在双方底线冲突中产出C。
Kahneman的快/慢二分法无法容纳这个四层拆分,因为四个层的反应都可以是"快"的(都可以被训练压缩),但结构复杂度完全不同。这正是速度作为分类标准的根本缺陷。
最能代表System 4独特性的,是上述15DD层的公平判断子集——它需要双主体结构的同时在场,这在12DD-14DD的任何一层都做不到。
7.3 速度的U型曲线:为什么Kahneman会把15DD误认为12DD
快/慢二分法最多只能容纳两个system——因为一旦你用速度做分类标准,你最多只能分出"快"和"慢"两类。所有介于两者之间的操作要么被强行归入其中一类,要么被描述为"两个system的冲突"。
换成结构复杂度做分类标准,这个限制立刻消失。12DD到15DD的维度递增给出了四个自然层级,每个层级有独立的定义性特征(无主体的认知,有主体的反思,有代价意识的意志,双主体的协商),而不是仅仅在速度谱上的不同位置。
但这里有一个更深的问题需要回答:如果每多一个维度就多一个自我校验环路,15DD应该比14DD更慢才对。为什么15DD反而可以快?
答案在于四层的速度不是线性递增的慢,而是一个U型曲线:
12DD——快。 没有自我指涉,没有内部校验环路。纯粹的刺激-响应,直通。
13DD——慢。 "我"出现了,自我指涉开始。"我在想"这个反思本身引入了一个前所未有的内部循环,需要时间运行。慢的来源是"我是什么"的不确定性。
14DD——也慢。 "不得不"和代价之间的张力需要持续校验——我真的不得不吗?代价我真的承受得住吗?慢的来源是"我的不得不是不是真的"的不确定性。
15DD——又快。 方向发生了根本性翻转:从"我"转向了"你"。当方向翻转时,13DD和14DD积累的自我校验不确定性反而被锁定了。因为做15DD的前提是14DD已经稳定——"我不得不什么"这个问题已经有了答案,不再需要持续校验(这就是不可跳跃性的另一面)。13DD和14DD的所有内部校验环路仍然在运行,但从前台移到了后台,成为了背景条件。认知资源全部释放给了唯一的新变量:"对面这个人不得不什么。"
所以12DD的快和15DD的快是结构上对称的两种快:12DD快是因为自我还没有出现,15DD快是因为自我已经稳定到不再需要校验。一头一尾,中间是自我的建立过程。Kahneman把15DD的快误认为12DD的快,正是因为他只看速度,不看速度背后的结构:12DD的快是空的快(里面没有主体),15DD的快是满的快(里面什么都有,但不再需要逐步校验)。
7.4 人类特殊性的DD定位:不是自意识,不是意志,是他者目的感知
这个U型曲线揭示了一个比Kahneman修正更深远的论点:人类在演化上的特殊性到底在哪里?
传统叙事给出了多种答案:理性(12DD),自意识(13DD),语言(跨层),工具使用(12DD),道德(14DD)。但黑猩猩有初步的因果推理(12DD),通过了镜子测试(13DD的前驱),使用并制造工具(12DD),甚至在某些条件下展现出初步的规范敏感(14DD边缘)。这些都不是人类独有的——它们是人类和其他灵长类共享的,只是程度不同。
DD框架给出的定位是:人类特殊性最可能在15DD——以直觉级速度感知"对面那个存在有他自己的不得不"的能力。这一能力在现有证据中尚未被直接测量为独立层级,但本文汇集的多条相邻证据——意图敏感的公平判断、第三方惩罚的多阶段整合、shared intentionality的人类独特性——共同指向这一方向。
这不是一个道德劝告("你应该尊重别人"),而是一个关于人类认知架构的结构性假说。人类之所以能够建立语言、制度、文化、法律、市场、科学共同体,不是因为我们更聪明(12DD),不是因为我们有自我意识(13DD),不是因为我们有意志(14DD),而是因为我们能在极短时间内感受到另一个主体的态度、意图和处境,并以此为基础做出响应——而且这个操作快到不像是"推理"的产物。
Theory of Mind研究反复发现人类对他人意图的判断速度快到不像是推理的产物,传统解释是"模块化"——大脑有一个专门的ToM模块。DD框架的解释更精确:不是模块化,是15DD的方向翻转把自我校验从前台移到了后台,释放了全部认知带宽给他者感知。模块只是这个结构性转移的神经实现方式。
黑猩猩卡在14DD,它的每一次合作都要从头做自我指涉的利益计算("这对我有没有好处"),所以慢。人类跨入15DD之后,自我校验被锁定为背景条件,认知资源全部释放给了"对面这个人怎么了",所以快。这个速度差就是人类合作能力爆炸性增长的认知基础。
这也是SAE框架名字的真正含义落地的地方。Self-as-an-End——每一个self都是end。不仅"我"是目的,"你"也是目的。这就是15DD。这就是人之为人。如果本文的四层架构成立,这就不仅是一个伦理命题,而是一个关于人类认知架构的结构性假说:我们可能是一个能以直觉速度把对面的存在当作独立目的来感知的物种。
八、非平凡预测
以下预测从DD序列推出,现有框架无法做出同样的预测,且可以被实验证伪。
8.1 预测一:被抢与被不公平对待的神经分离
推导: 12DD(纯损失反应)和15DD(公平判断)是不同层级的操作,即便主观报告都是"愤怒"。
预测: 在控制物质损失量相同的条件下,三种情境应该产生不同的神经签名:
(1)随机损失(自然灾害,彩票输了)——激活损失/威胁回路(杏仁核,vmPFC),不激活公平判断区域。
(2)被不公平分配(另一个人有选择但选了不公平的分法)——激活损失回路,且额外激活前脑岛、TPJ、dmPFC。
(3)第三方不公平(别人被不公平对待)——激活前脑岛、TPJ、dmPFC,但不激活自我相关的损失回路。
关键区分: 条件(1)和条件(2)在物质结果上可以完全相同(我都损失了同样的金额),在主观情绪报告上可能都是"愤怒",但神经基础不同。Kahneman框架无法做出这个预测,因为两者都是"快速负面情绪反应",都归System 1。
现有证据状态: 强adjacent support但缺乏决定性的直接对比实验。已有研究比较了自我vs第三方不公平、故意vs非故意不公平、人类vs电脑提案者,但没有在同一设计中比较随机损失、被盗和不公平分配。
8.2 预测二:System 4能力与网络连接性(而非局部激活强度)相关
推导: 15DD是整合性操作(同时容纳两个目的结构并产出C),其神经标志应该是网络级的功能连接,而非某个局部区域的激活峰值。
预测: 在协商任务中,System 4操作的质量(是否产出真C,而不是假C或单方面让步)应该与前脑岛-DLPFC-TPJ的功能连接强度相关,而非与任何单一区域的激活强度相关。这区别于System 3(可以由DLPFC的目标维持单独支撑)和System 2(可以由anterior PFC的元认知监控单独支撑)。
现有证据状态: 这是目前六条预测中最有实验支持的一条。Bellucci等人发现信任和互惠由静息态子网络而非单一脑区预测;Cáceda等人将互惠与显著性网络-中央执行网络之间的连接性联系起来;Shaw等人发现动态功能连接状态能区分合作与竞争。但几乎没有研究直接询问连接性是否优于局部激活来预测真正的协商质量。
值得注意的是,SAE精神分析第四篇(Cert)独立地给出了几乎相同的预测:Cert层的候选神经构型是DMN(自我参照)、frontostriatal(目标导向)和ToM网络(TPJ/mPFC)三者的同时高度协调激活——不是三个网络分别激活,而是三者之间的功能连接强度达到显著高于baseline的协调水平。两篇论文从不同切面(认知结构 vs 心理结构)独立推出了同一条神经预测,这本身就是DD序列先验有效性的一个间接支持。
8.3 预测三:稳定的意志能力是协商的必要前提,独立于心智化能力
推导: DD序列的不可跳跃性要求:15DD(协商)不能跳过14DD(意志)。没有稳定的"我不得不",就不可能产出C——因为你不知道什么可以让,什么不可以让。
预测: 在同时测量心智化能力(Theory of Mind)、意志稳定性(目标屏蔽、延迟折扣、停止信号任务等自我控制指标)和协商质量(是否产出真C)的实验中,意志稳定性应该解释协商质量中ToM无法解释的独立方差。具体来说,高ToM但低意志稳定性的被试应该在协商中倾向于让步或讨好,而非产出C。
现有证据状态: 间接支持较强但缺乏直接检验。Jäger等人发现自我调节干预帮助低权力谈判者克服结构性劣势;Winter等人发现攻击性罪犯可以展现完好的ToM但受损的共情;心理变态研究表明读心能力完好但合作受损。但几乎没有研究在同一样本中同时测量ToM、自我控制和协商质量。
8.4 预测四:叛逆期被压制→成年协商能力延迟
推导: 如果14DD是15DD的必要发育前提,那14DD没有充分发育的个体在15DD上应该受阻。
预测: 在权威型家庭中成长、叛逆期被强力压制的青少年,成年后在协商任务中应该更倾向于退回14DD(坚持己见或屈服),而不是产出C。具体预测三种退化模式:(1)固执(14DD的残留,意志有但不会协商),(2)顺从(退回13DD,放弃意志),(3)交替于固执和顺从之间(14DD不稳定)。真正的C产出应该显著减少。
现有证据状态: 纵向研究支持发育路径的合理性但尚未直接检验"不得不→协商"链条。Hare等人发现母亲的心理控制预测青少年与母亲及同伴的自主性降低;Allen等人发现16岁的自主性冲突预测25岁时同伴评定的敌意;Ha等人发现青春期早期的破坏性养育预测15年后成人浪漫关系中的胁迫行为。但没有研究直接将被压制的青少年自主性与成人协商结构(支配、顺从或真正的妥协)联系起来。
8.5 预测五:公平判断速度与训练/熟悉度相关,而非与认知层级相关
推导: 速度是训练程度的函数,不是认知层级的函数。公平判断是15DD操作,但因终身训练而被压缩为快速响应。
预测: 在跨文化ultimatum game实验中,反应速度应该随着文化对特定公平规范的熟悉度变化(熟悉的规范被违反时反应更快),而反应模式(拒绝被感知为不公平的提案)应该跨文化保持相对稳定。
现有证据状态: 最弱的一条预测。跨文化ultimatum game的行为数据存在,但缺乏系统的反应时间比较。现有meta分析显示回应者行为跨地区有所不同,这部分支持也部分挑战了该预测——需要进一步区分反应模式的文化差异和反应速度的文化差异。
8.6 预测六:高认知负荷下,高DD层级操作比低DD层级操作更容易崩溃
推导: 四层结构是包含关系——System 4内部同时运行着System 3、2、1。这意味着15DD操作即使被训练成快速直觉,其底层仍然包含更多的子系统协调需求。如果认知资源被高负荷任务占用,高DD层级的操作应该比低DD层级的操作更容易退化,因为可供协调的资源减少了。
预测: 在执行快速的15DD判断(如公平感直觉)时,如果同时给予被试极高的无关认知负荷(如工作记忆任务),其公平判断的质量应该比12DD的纯损失反应更容易受到干扰。具体来说:高负荷条件下,被试应该更容易从15DD的公平判断退化为12DD的纯损失厌恶——不再区分"不公平"和"我吃亏了",而是把两者混为一谈。
现有证据状态: 有部分间接支持。Harris等人发现认知负荷改变了ultimatum game中对不公平提案的接受率,特别是当提案者身份被操纵时。这至少表明社会认知层面的公平判断不是完全负荷无关的。但现有研究没有在同一设计中比较不同DD层级操作在负荷条件下的差异性退化。
与预测二的关系: 本预测和预测二(高负荷下13DD/14DD合并)是同一个逻辑的不同应用。预测二说的是同一个D级内的精细区分在负荷下塌缩;本预测说的是不同DD层级的操作在负荷下的差异性退化——越高的层级越容易退化。两条预测共同构成了DD序列在认知负荷条件下的完整行为预测。
8.7 预测七:真C,假C,与诚实的no deal——一个核心实验范式
推导: 这是本文最核心的非平凡预测,也指向现有认知科学实验范式的根本盲区。
现有的fairness范式几乎全都是二选一结构:接受/拒绝(ultimatum game),投/不投(trust game),合作/背叛(prisoner's dilemma)。没有人在测"创造第三个选项"的能力。但15DD的核心操作恰恰是C的产出——一个既不是A也不是B的新方案。
C的产出过程: 两个"不得不"撞到一起,双方通过互相追问"为什么不得不",剥离手段与真正的不可让渡项。追问的方向不是"你能不能让一点",而是"你为什么不得不"。我说我不得不要A,你问我为什么,我往下挖,发现A其实是为了保障A',A'才是我真正不可让渡的东西,A只是我以为的唯一手段。同理,你的B背后是B'。C就是同时满足A'和B'的新方案。
三种结果:
(1)互容——真C。 A'和B'之间不矛盾,C存在。之前的冲突是假的,是手段层面的碰撞。追问把假冲突剥掉了,C自然浮现。绝大多数日常冲突属于这一类。
(2)不互容——诚实的no deal。 A'和B'在根本上矛盾,C不存在。这时候no deal是唯一诚实的结果。
(3)未完成——追问没到底。 时间用完了,还没挖到A'和B'。15DD操作启动了但未完成。
可操作化的实验设计: 两个被试各自拿到一个目标(A和B),表面上互斥。但A和B背后各有一个deeper motive(A'和B'),A'和B'之间存在至少一个兼容方案C。被试可以互相提问,限定时间,有no deal选项。
关键测量: 不是"有没有达成协议"(这是现有范式的标准),而是"有没有追问到A'和B'"。一对被试可能达成了协议但其实是一方让步(假C),也可能no deal但双方都挖到了A'和B'(15DD操作正确但发现真的不互容或时间不够)。
非平凡预测的神经对应:
真C——双方前脑岛-DLPFC-TPJ的功能连接同步增强。
假C——一方的DLPFC目标维持活动下降(14DD退出),类似Knoch实验里"知道不公平但不行动"的模式。
诚实的no deal——双方TPJ和dmPFC高激活(充分的他者建模),但没有整合阶段的连接性增强。
为什么这是最核心的预测: 因为它指出了整个研究领域的范式盲区。从Kahneman的框架出发,你没有理由去测C的产出——因为你只有两个system,而协商被分解为"System 1的情绪冲动"加"System 2的理性抑制"。只有从DD序列出发,你才知道15DD要求一个新的实验范式,而这个范式里最重要的变量不是反应时间、不是激活强度,而是C的存在与否。
九、教育学应用
四层认知结构在儿童发展中的依次涌现,直接产出一个教育学原则:在每个System即将上线的窗口期,给对的材料,效果最大。
13DD窗口期(3-8岁):死亡教育。 不是回避孩子对死亡的恐惧,而是正面面对它。"我会消失"是13DD的标志性体验,它伴随着焦虑是正常的。教育的功能不是消除这个焦虑,而是帮助孩子建立一个足够稳定的"我"来承载它。
14DD窗口期(10-14岁):康德。 叛逆期的孩子刚建立了14DD——"我不得不"——他正处在意志最强烈、最纯粹、也最痛苦的时期。如果这时候给他叔本华或尼采,他会在14DD里越陷越深。但如果给他康德的定言令式——"把人当作目的而不仅仅是手段"——效果完全不同。这不是在否定他的意志,而是在告诉他:你的"不得不"是合法的,但对面那个人也有"不得不"。这恰好是从14DD到15DD的跳板。
15DD窗口期(14岁起):协商训练,不是辩论训练。 辩论是两个14DD在对撞——我要说服你,你要说服我。协商是15DD——我为什么不得不,你为什么不得不,有没有C。训练的核心不是修辞技巧,而是追问能力:能不能把对方的表面诉求(B)追问到真正的不可让渡项(B'),同时也把自己的表面诉求(A)追问到A'。
一个反面预测: 如果14DD窗口期被压制(叛逆被强力制止),15DD的发育会延迟甚至受阻。这意味着过度权威型的教育方式可能在短期内获得"听话"的效果,但在长期上损害了孩子产出C的能力——他要么固执(14DD残留),要么顺从(退回13DD),要么在两者之间交替,但很难做真正的协商。
十、结语
本文的出发点是一个简单的问题:公平相关认知为什么难以归类?
Kahneman框架的流行表述会说:公平反应快,所以是System 1。本文的回答是:公平相关认知是一个横跨12DD到15DD的多层认知家族。其中最能代表System 4独特性的子集——意图敏感、第三方、双主体整合性公平判断——不是"快速情绪反应",而是被充分训练的高结构复杂度操作。
但本文到达了一个比修正Kahneman更深远的结论。四层架构的速度U型曲线——12DD快(无自我),13DD慢(自我出现),14DD慢(自我校验),15DD又快(方向翻转,自我校验锁定为背景)——揭示了Kahneman把15DD误认为12DD的结构性原因,也揭示了人类特殊性的DD定位。
人类在演化上的特殊性不在于自意识(13DD,黑猩猩也有前驱),不在于意志(14DD,黑猩猩也有目标导向行为),而在于15DD——以直觉级速度感知"对面那个存在有他自己的不得不"的能力。人类之所以能够建立语言、制度、文化、法律、市场、科学共同体,是因为我们能在极短时间内感受到另一个主体的态度、意图和处境,并以此为基础做出响应。这个速度来自15DD的方向翻转:自我校验被锁定为背景条件,全部认知带宽释放给了他者感知。
这也是SAE框架名字的真正含义在认知科学中落地的地方。Self-as-an-End——每一个self都是end。不仅"我"是目的,"你"也是目的。这就是15DD。这就是人之为人。如果本文的四层架构成立,这就不仅是一个伦理命题,而是一个关于人类认知架构的结构性描述。
本文提出的七条非平凡预测,尤其是关于"C的产出"的实验范式,指向了现有认知科学的一个根本盲区。这个盲区不是因为技术限制或数据不足,而是因为理论框架的限制——如果你的框架里只有两个system,你就不会去设计测量C的实验,因为你不知道C应该存在。
先验引路,后验查证。DD序列是先验结构,既有文献是后验数据,两者在本文中的汇合不是巧合,而是方法论的必然。接下来的工作是:让后验跑在前面——设计并执行本文提出的实验,特别是"不得不追问"范式,让四层架构从哲学假说变成可检验、可证伪、可改进的认知科学理论。
致谢
感谢Zesi Chen对SAE框架长期的批评与共建。本文的四层结构在与Claude(Anthropic)的对话中成形,其中System 4的定义——"面向对面那个主体,为什么会那样思考"——是对话中的关键转折点。这一工作方式本身就是15DD的实例:先验结构由人类提供,后验查证由AI加速,C是两者都独自做不到的东西。
参考文献
1. Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux.
2. Fleming, S. M., Weil, R. S., Nagy, Z., Dolan, R. J., & Rees, G. (2010). Relating introspective accuracy to individual differences in brain structure. Science, 329(5998), 1541-1543.
3. Knoch, D., Pascual-Leone, A., Meyer, K., Treyer, V., & Fehr, E. (2006). Diminishing reciprocal fairness by disrupting the right prefrontal cortex. Science, 314(5800), 829-832.
4. Corradi-Dell'Acqua, C., Civai, C., Rumiati, R. I., & Fink, G. R. (2013). Disentangling self- and fairness-related neural mechanisms in the ultimatum game. Social Cognitive and Affective Neuroscience, 8(4), 424-431.
5. Ginther, M. R., Bonnie, R. J., Hoffman, M. B., Shen, F. X., Simons, K. W., Jones, O. D., & Marois, R. (2016). Parsing the behavioral and brain mechanisms of third-party punishment. Journal of Neuroscience, 36(36), 9420-9434.
6. Tomasello, M. (2014). A Natural History of Human Thinking. Harvard University Press.
7. Tomasello, M., & Carpenter, M. (2007). Shared intentionality. Developmental Science, 10(1), 121-125.
8. Melis, A. P., Hare, B., & Tomasello, M. (2006). Chimpanzees recruit the best collaborators. Science, 311(5765), 1297-1300.
9. Hamann, K., Warneken, F., Greenberg, J. R., & Tomasello, M. (2011). Collaboration encourages equal sharing in children but not in chimpanzees. Nature, 476(7360), 328-331.
10. Warneken, F., & Tomasello, M. (2006). Altruistic helping in human infants and young chimpanzees. Science, 311(5765), 1301-1303.
11. Sanfey, A. G., Rilling, J. K., Aronson, J. A., Nystrom, L. E., & Cohen, J. D. (2003). The neural basis of economic decision-making in the Ultimatum Game. Science, 300(5626), 1755-1758.
12. Baron, J., & Gürçay, B. (2017). A meta-analysis of response-time tests of the sequential two-systems model of moral judgment. Memory & Cognition, 45(4), 566-575.
13. Bellucci, G., Chernyak, S. V., Goodyear, K., Eickhoff, S. B., & Krueger, F. (2017). Neural signatures of trust in reciprocity: A coordinate-based meta-analysis. Human Brain Mapping, 38(3), 1233-1248.
14. Cáceda, R., Prendes-Alvarez, S., Hsu, J. J., Tripathi, S. P., Kilts, C. D., & James, G. A. (2015). The neural correlates of reciprocity are sensitive to prior experience of reciprocity. Behavioural Brain Research, 283, 243-254.
15. Shaw, D. J., Czekóová, K., Staněk, R., Mareček, R., Urbánek, T., Špalek, J., Kopečková, L., Řezáč, J., & Brázdil, M. (2018). A dual-fMRI investigation of the iterated Ultimatum Game reveals that reciprocal behaviour is associated with neural alignment. Scientific Reports, 8(1), 10896.
16. Sazhin, D., et al. (2020). Strategic bargaining and connectivity changes involving TPJ and anterior insula. Experimental Brain Research.
17. Jäger, A., Loschelder, D. D., & Friese, M. (2017). How self-regulation helps negotiators to achieve better outcomes. Journal of Economic Psychology, 58, 29-42.
18. Hare, A. L., Szwedo, D. E., Schad, M. M., & Allen, J. P. (2015). Undermining adolescent autonomy with parents and peers. Social Development, 24(1), 177-196.
19. Allen, J. P., Hauser, S. T., O'Connor, T. G., & Bell, K. L. (2002). Prediction of peer-rated adult hostility from autonomy struggles in adolescent-family interactions. Development and Psychopathology, 14(1), 123-137.
20. Ha, T., Otten, R., McGill, S., &"; Dishion, T. J. (2019). The family and peer origins of coercion within adult romantic relationships. Merrill-Palmer Quarterly, 65(3), 286-309.
21. Wittmann, M. K., Kolling, N., Faber, N. S., Scholl, J., Nelissen, N., & Rushworth, M. F. S. (2016). Self-other mergence in the frontal cortex during cooperation and competition. Neuron, 91(2), 482-493.
22. Han Qin (2024). Systems, Emergence, and the Conditions of Personhood. DOI: 10.5281/zenodo.18528813.
23. Han Qin (2024). Internal Colonization and the Reconstruction of Subjecthood. DOI: 10.5281/zenodo.18666645.
24. Han Qin (2024). The Complete Self-as-an-End Framework. DOI: 10.5281/zenodo.18727327.
25. Han Qin (2026). SAE Psychoanalysis I: Id — The Me Without Self. DOI: 10.5281/zenodo.19321143.
26. Han Qin (2026). SAE Psychoanalysis II: Ego — The Self Without Purpose. DOI: 10.5281/zenodo.19321314.
27. Han Qin (2026). SAE Psychoanalysis III: Superego — The Self With Purpose. DOI: 10.5281/zenodo.19321417.
28. Han Qin (2026). SAE Psychoanalysis IV: Cert and Unification. DOI: 10.5281/zenodo.19321534.
29. De Neys, W. (2023). Advancing theorizing about fast-and-slow thinking. Behavioral and Brain Sciences, 46, e111.
30. Gräfenhain, M., Behne, T., Carpenter, M., & Tomasello, M. (2009). Young children's understanding of joint commitments. Developmental Psychology, 45(5), 1430-1443.
31. Han Qin (2026). What Terrible Twos Actually Is: The Structural Genesis of Self-Awareness. DOI: 10.5281/zenodo.19044827.
32. Han Qin (2026). What Terrible Teens Actually Is: The Structural Genesis of Purpose. DOI: 10.5281/zenodo.19201631.
33. Han Qin (2026). What Terrible Telos Actually Is: The Structural Genesis of Non Dubito. DOI: 10.5281/zenodo.19248039.