Self-as-an-End
Temporal Arts Series · Essay IV

Chisel and Construct: The Universal Structure of Temporal Arts
Essay IV: Theatre and Film
凿与构:时间性艺术的通用结构·第四篇:话剧/电影

DOI: 10.5281/zenodo.18989755  ·  CC BY 4.0
Han Qin · 2026
EN
中文

The first three essays tested Arise-Settle-Unfold-Fix in audition (music), cross-channel (opera / Chinese opera), and kinesthetic (dance) domains. Each essay added a new mechanism: Essay I established the basic cycle, Essay II introduced cross-channel chisel, Essay III proved the cycle does not depend on any particular sensory channel.

Now we enter the final testing ground: theatre and film.

Theatre and film are special in two respects. First, narrative becomes the primary channel — not sound, not bodily movement, but "story" itself carries the chisel-construct cycle. Second, film is the temporal art form with the greatest number of simultaneous channels — narrative, dialogue, visual composition, editing rhythm, music, sound design all running at once.

This means film can run multiple chisel-construct cycles simultaneously, each at a different level, operating independently yet mutually influencing one another. This is a degree of complexity untouched in the first three essays.

At the same time, the more channels available, the greater a new risk becomes: pseudo-chisel.

I. How Narrative Runs the Chisel-Construct Cycle

Isolate narrative first, without other channels. Theatre is the purest form — essentially only dialogue and performance; no editing, no score (or very little), the visual channel reduced to stage design.

The narrative chisel-construct cycle is fully isomorphic with that of music; only the material differs:

Arise: The story's opening establishes a "world" and a set of character relationships. Your brain begins predicting — how will these characters interact? Which direction will the conflict take? You form a narrative predictive model.

Settle: Plot development confirms your model. Characters behave consistently with your expectations of their personalities; conflict advances according to the logic you anticipated. You feel: "I understand what this story is doing."

Unfold: At some moment, your narrative predictive model is broken. An unanticipated turn; a character does something you thought they would not do; a conflict you thought resolved reappears in a new form. But as with musical Unfold — this is not random; it follows a logic comprehensible in retrospect but not predictable in advance.

Fix: The story's ending. But having passed through Unfold, the same characters now carry different weight in your psychology. The ending is not a return to the beginning; it is closure carrying the trace of chisel.

Cao Yu, Thunderstorm (雷雨).

A four-act spoken drama with an extremely complete chisel-construct cycle.

Arise: The Zhou mansion; its surface order and respectability establish a "world." You form a predictive model — this is a story about family secrets and class conflict.

Settle: The first two acts continuously confirm this model. Zhou Puyuan's authority, Fanyi's repression, Sifeng's innocence. You grow increasingly certain you understand these characters.

Unfold: The revelation of identity relationships. The character relationships you thought you understood are entirely redefined — but Cao Yu's mastery lies in revelation not being a single event. Each layer of revelation breaks the model you have just rebuilt; remainder accumulates layer upon layer. By Act III you feel: "I do not know how this story will end" — yet you also feel it must end according to some logic you cannot articulate.

Fix: The final destruction. All characters are destroyed, but what you feel is not "the story is over" but "this ending weighs far more than the beginning." Closure carries the trace of every preceding chisel.

Arthur Miller, Death of a Salesman.

In counterpoint with Thunderstorm. Entirely different cultural and theatrical tradition, but isomorphic chisel-construct cycle.

Arise: Willy Loman's daily life establishes a predictive model of the "American Dream."

Settle: His self-deception and family relationships continuously confirm this model — you grow increasingly clear about what this man is doing.

Unfold: Miller's chisel does not rely on plot twist but on temporal structure. Past and present scenes alternate; your predictive model of "reality" is broken by the disorientation of time. You are unsure whether what you see is memory, hallucination, or present event.

Fix: Willy's death. But the weight of this closure comes from all the temporal disorientation that preceded it — what you feel at the end is not merely "a man died" but "an entire narrative model of success and failure has collapsed."

The two plays' encoding systems are entirely different (Chinese family-ethics tragedy vs. American social realism), but their chisel-construct operations are isomorphic.

II. Film: Multi-Level Chisel-Construct

Theatre is essentially single-channel narrative plus performance. Film pushes channel count to the extreme.

A film simultaneously runs: narrative line (the chisel-construct cycle of plot), visual composition (the cycle of the image), editing rhythm (the cycle of time), music/sound design (the auditory cycle), performance (the body-emotion cycle).

Each line can independently run its own Arise-Settle-Unfold-Fix. But more importantly, they can chisel across levels — one line in Settle while another is in Unfold, just as in Essay II where Peking opera's singing and body work ran at different phases. Film simply has more channels available for this operation.

But film's true specificity is not merely "more channels." If it were only quantitative stacking, the difference between film and opera would be of degree, not kind.

The qualitative difference is this: film (and theatre) introduce a channel type the first three essays did not address — the narrative channel. The narrative channel carries not a sensory-level predictive model ("what is the next sound," "what is the next movement") but a conceptual/logical-level predictive model ("what will this character do," "how will this story end," "what are the rules of this world").

When the conceptual model and the sensory model are in severe misalignment — narrative tells you one thing while your eyes and ears tell you another — this is the chisel form unique to film. Essay II's cross-channel chisel occurred primarily between sensory channels (audition vs. vision). Essay IV's multi-level chisel occurs between the conceptual channel and sensory channels. This is an escalation in level, not merely in quantity.

Kubrick, 2001: A Space Odyssey: Multi-level chisel at its extreme.

This film's narrative line is extremely spare — humanity discovers the monolith, goes to Jupiter to investigate, encounters AI failure, the protagonist undergoes some transcendent experience. Looking only at plot, the predictive model stabilizes quickly.

But Kubrick performs extreme chiseling at the visual and musical levels.

The "Star Gate" sequence: narrative is nearly static (you do not know what is happening, but neither does narrative seem to advance); the visual channel completely takes over — color, light, abstract patterns arrive at a rhythm you cannot predict. Music shifts from Ligeti's micropolyphony to the reappearance of Strauss's "Blue Danube," each musical choice breaking your predictive model of "what a space film should sound like."

Narrative is in Settle (or even in suspension); vision is in Unfold; music performs another form of Unfold. Three chisel-construct cycles run at different phases simultaneously; their desynchronization produces enormous remainder — after viewing, you cannot articulate what this film is "about," but you know something is there. This remainder remains inexhaustible after more than fifty years.

Ozu Yasujirō, Tokyo Story: Chisel within extreme restraint.

The precise opposite strategy from Kubrick.

Ozu's images are almost motionless. Fixed camera, low angle, symmetrical composition. Editing is slow; scenes transition through "pillow shots" (empty images — a vase, a corridor, a patch of sky). Music is minimal. The narrative is extremely mundane — an elderly couple visits their children in Tokyo; the children are too busy to spend time with them; the couple returns home; the wife dies.

On the surface, all channels are in Settle. No visual spectacle, no plot twist, no musical climax.

Yet this film still yields something after ten viewings.

Why? Because Ozu's chisel does not occur within any single channel but between channels and your expectation. You expect a story about family estrangement to have an emotional eruption — there is none. You expect the wife's death to be presented tragically — Ozu handles it with a single pillow shot. You expect the ending to contain some reconciliation or insight — the daughter-in-law says something extremely bland, then the image cuts to an empty landscape.

Each time your narrative expectation is broken by "non-occurrence," that is a chisel. Ozu's chisel is not "what he did" but "what he did not do." He carved a hole in your predictive model — and that hole is remainder. You cannot fill it with "having understood," because the hole's shape is precisely "something should have been here but was not."

This technique of producing remainder through absence is structurally equivalent to Pina Bausch's technique of producing remainder through excessive repetition — both use an extreme (extreme absence vs. extreme repetition) to push Settle to the threshold of phase transition.

III. Pseudo-Chisel: Producing Surprise Without Changing the Model

A concept repeatedly intimated in the first three essays can now be formally defined.

Pseudo-chisel = producing surprise without changing the model.

It startles you but does not alter the weight of closure. Your predictive model is briefly perturbed, but after the perturbation the model restores itself unchanged, leaving no trace. On repeated viewing, remainder is rapidly exhausted.

More precisely: pseudo-chisel is a low-dimensional perturbation that can be immediately absorbed by the cognitive system. When genuine chisel occurs, the old model fails and you must construct a new model containing the trace of rupture — this rebuilding process produces residual information exceeding your current cognitive processing capacity, i.e., remainder. When pseudo-chisel occurs, the cognitive system absorbs the "surprise" as a known parameter appended to the original model simultaneously with closure. No rebuilding is needed, only updating a variable. On the second experience, that variable has already been absorbed; information entropy is strictly zero.

Pseudo-chisel is particularly common in film because film has so many channels available for manufacturing "surprise": a sudden cut, a jump scare in the sound design, a plot twist, a visual-effects spectacle. Each can startle you. But startling is not chiseling.

Genuine chisel, after breaking your model, requires you to rebuild — and the rebuilt model is thicker than its predecessor because it incorporates the trace of breaking. This is Fix.

Pseudo-chisel, after breaking your attention, leaves your model restored identically. No rebuilding, no new thickness. The content of Fix equals the content of Settle. Nothing has happened.

This is why many large-budget films "are exciting the first time but uninteresting the second" — they are filled with pseudo-chisel. On first viewing you do not know what the twist is, so every twist startles you. On second viewing you already know, and the portions beyond the twists (narrative construct, character depth, visual layering) provide no repeatedly discoverable remainder. Information reaches zero on the second pass.

IV. Isomorphic Comparison: Thunderstorm and Death of a Salesman

Section I already presented the chisel-construct cycles of both plays. Now the formal isomorphic comparison.

The two works' encoding systems are entirely different. Thunderstorm uses the Chinese family-ethics framework; the complexity of character relationships is built through the interweaving of blood kinship and class. Its Unfold is primarily through identity revelation — you thought you knew who these people were; you were wrong. Death of a Salesman uses the American Dream framework; its Unfold is primarily through temporal-structural disorientation — you thought you knew what was real; you were wrong.

One chisels "who people are"; the other chisels "what is real." The object of chiseling differs, but the operation is isomorphic: break the narrative predictive model, produce remainder that cannot be absorbed in a single pass, then in closure let the weight of remainder sink into the ending.

Both plays cross cycles. Thunderstorm has been continuously performed since its 1934 premiere. Death of a Salesman has been a staple of the American theatrical canon since its 1949 premiere. The reason, at the cognitive level, is the same: their remainder is inexhaustible — each viewing reveals new layers in the Unfold.

V. Heteromorphic Equivalence: 2001: A Space Odyssey and Tokyo Story

If Thunderstorm and Death of a Salesman are "different encodings performing the same operation," then Kubrick and Ozu demonstrate a further level: "entirely opposite chisel methods producing the same remainder effect."

Kubrick relies on extreme sensory overload — the "Star Gate" sequence uses a flood of vision and sound to overwhelm your predictive model. Ozu relies on extreme sensory restraint — using "non-occurrence" to carve holes in your predictive model.

One uses excess to produce remainder; the other uses absence. Methods are opposite; effects are equivalent: both films still yield something after ten viewings, because their remainder is inexhaustible.

This parallels Essay III's "Noh theater vs. Pina Bausch" in structure — Noh relies on micro-deviation; Bausch on excessive repetition; Kubrick on sensory flood; Ozu on sensory vacuum. Four entirely different methods; the same remainder effect.

Heteromorphic equivalence is not coincidence; it is evidence for the universality of the chisel-construct cycle: as long as the operation genuinely breaks the predictive model and produces inexhaustible remainder, the method of breaking does not matter.

5a. Dialogue with Aristotle

The narrative chisel-construct cycle has one predecessor with whom dialogue is necessary: Aristotle's Poetics.

Twenty-three hundred years ago, Aristotle analyzed the structure of Greek tragedy and proposed several core concepts: peripeteia — a sudden reversal of fortune; anagnorisis — a critical recognition or discovery; katharsis — the emotional purgation the audience undergoes at the tragedy's conclusion.

The correspondence between these concepts and Arise-Settle-Unfold-Fix is nearly direct: peripeteia and anagnorisis are Unfold — they break the audience's predictive model of the story's direction. Katharsis is Fix — the emotional closure following the breaking.

But Aristotle's framework has two limitations.

First, he addresses only the narrative channel. The Poetics discusses plot structure; it does not address music, visuals, bodily movement, or other channels. He can therefore explain why theatre is powerful, but cannot explain the multi-level chisel of film — why the "Star Gate" sequence of 2001: A Space Odyssey produces enormous impact even as narrative has nearly halted. In Aristotle's framework, without peripeteia in the plot there is no power. In Arise-Settle-Unfold-Fix, narrative is only one channel; chisel can occur in any channel.

Second, he has no concept of remainder. Katharsis is a one-time event — you watch the tragedy, emotion is released, done. But this cannot explain why some works can be viewed repeatedly without diminished katharsis — if katharsis is a one-time release, the second viewing should have no effect. "Remainder" explains this: good works expose, with each viewing, new unabsorbed material, so the content of Fix differs each time. Katharsis is not a one-time release but, with each closure, an integration of newly exposed remainder.

Aristotle's position: within the narrative channel he precisely described the operations of Unfold and Fix (peripeteia / anagnorisis / katharsis), but stopped at a single channel and a single experience. The present essay's position: extending Aristotle's insight to all channels (chisel is not limited to narrative) and introducing remainder to explain why good works sustain repeated experience.

As with Hanslick/Meyer in Essay I, Wagner in Essay II, and Laban in Essay III: a predecessor grasped one leg of the elephant. The present essay attempts to assemble the whole animal.

VI. Counter-Example: Exciting to Watch, Not Worth Re-Watching

Nolan's Tenet vs. Interstellar.

Same director, comparable production capability, but entirely different depth of remainder.

Interstellar: the narrative chisel-construct cycle is complete — the father-daughter relationship builds a powerful emotional construct (Arise-Settle); the black hole and time dilation break your predictive model of "rescue" (Unfold); the final "bookshelf scene" closes everything at a location you did not expect but in retrospect feel was necessary (Fix). Moreover, the music (Hans Zimmer's organ) performs precise cross-channel chisel with the narrative — musical emotional intensity often exceeds the narrative's current logic, producing phase difference between channels.

Still yields something after ten viewings. Because the emotional remainder of the father-daughter relationship and the cognitive remainder of time/space each reveal new facets with each viewing.

Tenet: technically equally accomplished, the concept of time inversion equally stunning, yet on second viewing you feel something is missing. Why?

Because Tenet's primary Unfold is conceptual — the time-inversion premise is itself the greatest breaking. On first viewing your predictive model is completely overwhelmed; you spend your entire cognitive budget on understanding "what is happening." But on second viewing, you already understand the mechanism; this conceptual breaking is no longer effective. And beneath the concept, the narrative emotional construct (the protagonist's relationship with his partner, the protagonist's motivation) is relatively thin — squeezed out by conceptual complexity.

In the language of this essay: Tenet's Unfold is single-pass — conceptual breaking is fully absorbed on the first viewing. Interstellar's Unfold is multi-level — conceptual breaking (time dilation) and emotional breaking (the price of the father-daughter relationship) operate on different channels, each with its own remainder, and the phase difference between levels produces additional remainder.

This is how pseudo-chisel and genuine chisel manifest in specific works. Tenet is not bad; its chisel is concentrated on a single level, and that level is exhausted after one pass. Interstellar's chisel is distributed across multiple levels, each with remainder that cannot be exhausted in a single pass.

A more extreme example: the degradation curve of the Marvel series.

To be fair, early Marvel had several works with relatively complete chisel-construct cycles — the first Iron Man features Tony Stark's character arc from arms dealer to hero as a genuine construct-to-chisel-to-traced-closure, and Captain America: The Winter Soldier's political-thriller structure performs genuine Unfold at the narrative level. These works bear repeated viewing because character change and narrative breaking produce real remainder.

But as the series expanded, a degradation pattern became increasingly pronounced. Later Marvel films' structure tends toward: establish the hero's initial state (Arise), confirm abilities and mission (Settle), the villain appears and creates crisis (Unfold?), the hero defeats the villain (Fix).

This appears to be a complete four steps. But why are most later Marvel films sufficient after three viewings?

Because increasingly, what occupies the Unfold position is pseudo-chisel. The villain's appearance does create "surprise," but this surprise does not change your model of the entire narrative framework — you know the hero will ultimately win, you know those who are sacrificed may return in the next installment, you know the universe's rules will not truly be broken. Your predictive model is briefly perturbed, then restores itself unchanged.

The visual level is even more so. Each battle is a visual spectacle, but between spectacles there is no chisel-construct cycle — one explosion follows another, your visual predictive model is constantly in Unfold without ever entering Fix. This is the "Unfold-Unfold-Unfold-Unfold" pure-virtuosity structure described in Essay III.

This degradation curve is itself an argument: within the same series, early works had genuine chisel, later works were increasingly dominated by pseudo-chisel, and audience willingness to re-watch declined correspondingly. Production quality did not decline — the chisel-construct cycle's completeness was sacrificed in the process of industrialization. Construct grew increasingly refined (visual effects increasingly impressive), but chisel grew increasingly scarce (your predictive model was increasingly unlikely to be genuinely broken).

VII. The Oral Tradition: Theatre's Ancestor

Before concluding, a supplementary form skipped in previous essays: the oral tradition.

The recitation of Homeric epic, Chinese pingshu storytelling, Japanese rakugo, West African griot oral narrative — these are the oldest forms of "narrative as primary channel." They are the ancestors of theatre and film.

The oral tradition's specificity lies in this: it has only two channels — voice and narrative (no stage design, no editing, no score) — yet its chisel-construct cycle is equally complete.

The Chinese pingshu storyteller's "hook" (扣子, suspense setup) is Arise and Settle — building your prediction of the story's direction. "Flipping the bundle" (翻包袱, surprise reveal) is Unfold — breaking prediction, exposing remainder. The final gathering (收束) is Fix.

Japanese rakugo is even more extreme — a single person sitting, without props, relying only on vocal variation (shifting character voices, alternating fast and slow rhythms) to run the chisel-construct cycle. It proves one thing: even with channels compressed to a minimum (only voice + narrative), the cycle can still run completely.

This is symmetrical with Essay III's argument that "dance holds even in near-silence": dance proved that removing sound leaves the cycle intact; the oral tradition proves that removing visuals leaves the cycle intact. Channels can increase or decrease; the cycle does not change.

VIII. Series Conclusion: The Universal Structure of All Temporal Arts

Four essays are complete. Return to the opening question.

Why do some melodies still have something after a hundred listens? Why do some plays still reveal new things after ten viewings? Why does some dance make you feel "the ending weighs more than the beginning"? Why can some films be watched repeatedly without exhaustion?

The answer is the same:

Enduring works = four steps complete + remainder real and inexhaustible.

This criterion crosses:

Sensory channels — applies to audition (music), kinesthesia (dance), vision + narrative (film)
Cultural traditions — applies to Beethoven and to K-pop, to Peking opera and to street dance
Historical periods — applies to Bach and to Kendrick Lamar, to Noh theater and to Kubrick
Media complexity — applies to single-channel (pure instrumental music) and to full-media (film)

Because it is not the product of any specific encoding system. It is the basic operation of the human cognitive system: model → confirm → model broken → rebuild carrying trace.

Music theory is this operation's encoding in sound. Dance vocabulary is this operation's encoding in the body. Narrative structure is this operation's encoding in story. Each encoding system is sophisticated and unique, but what they encode is the same thing.

Arise-Settle-Unfold-Fix is taken directly from the Life Cycle Table of the Self-as-an-End framework. Music moves you, dance holds your breath, film keeps you motionless long after the credits — all because they replicate, in their respective channels, the rhythm of life itself: selection, construction, being chiseled open, re-closure.

This series does not discuss music theory. It discusses the rationale behind music theory.

It does not discuss dance technique. It discusses the rationale behind technique.

It does not discuss narrative theory. It discusses the rationale behind narrative theory.

The rationale is one.


One final point: a prediction that follows naturally from the framework. Temporal arts have no ceiling, no terminus.

The reason is not the platitude "human creativity is infinite." The reason is that the structure of the chisel-construct cycle itself guarantees it cannot terminate.

Any form, once familiar to an audience, becomes construct. Once construct stabilizes, it can be chiseled. Chiseling exposes remainder. Remainder drives the formation of new construct. New construct is chiseled again.

Sonata form was construct; Schoenberg chiseled it. Twelve-tone technique became new construct; Cage chiseled it. 4'33" became construct; the next person will chisel it. Peking opera's modal framework was construct; Mei Lanfang performed micro-chisel within it. Mei Lanfang's style itself became construct; the next generation will chisel it. Classical ballet was construct; Pina Bausch chiseled it. Bausch's repetition technique became construct; the next generation of choreographers will chisel it.

There is no terminus. Because remainder — the core concept running throughout the Self-as-an-End framework — can never be fully absorbed. Every closure carries inexhaustible residue, and that residue is the starting point of the next chisel.

This is not optimism. It is structure.