Sonic Grading, Part 3: The Temp Track Illusion
This is Part 3 of the Sonic Grading series on temporal and spatial placement. Read Part 1 | Read Part 2
Here's where it gets interesting.
Remember that temp track I mentioned in Part 1? The recording of Orff's Carmina Burana that was establishing this heavy, oppressive, organ-driven liturgical vibe?
Start pulling the thread on that specific recording — the 1974 CBS Masterworks version by Michael Tilson Thomas and the Cleveland Orchestra — and the three-variable model unravels a massive plot twist at every layer.
1. The Composer: Orff's original 1936 score calls for full orchestra, choir, and two pianos as a featured component. The text is medieval Latin and Middle High German — a springtime fertility poem from a 13th-century manuscript. Nothing liturgical. Nothing dark. A reduced version for two pianos and percussion was later authorized by Orff in 1956, to make the work accessible to smaller ensembles. No version calls for organ. Somewhere along the line, however, productions began introducing sustained, pipe-like frequencies and heavy liturgical weight, transforming the emotional identity of the piece entirely.
2. The Performers and the Vision: MTT's interpretation was explicitly not a straight reading. Critics described it as "Orff seen through the lens of Stravinsky's Les Noces" — mechanical, obsessive, rhythmically driven. Unusual and extreme by critical consensus of the time. This was not Cleveland playing Orff. This was MTT's argument about Orff, captured on tape.
3. The Technology — and the Real Revelation: Producer Andrew Kazdin — Glenn Gould's longtime collaborator and a pioneer of multi-channel orchestral recording — designed this session explicitly as a quadraphonic spatial showcase. His stated goal was to present "a different placement of each individual number" across the sound field. The spatial weight, the sense of vast liturgical space, the immersive surround of choir — that wasn't a room. It was an engineered construct, built in a studio and designed for a speaker format most people no longer own.
What I was actually hearing — streaming in stereo in 2026 — was a collapsed, two-channel downmix of a quadraphonic spatial experiment from 1974. I was listening to an orchestra interpreting a 1936 choral cantata set to a 13th-century springtime poem, cut against a violent ritual scene.
And here's the honest admission: I wasn't hearing what I thought I was hearing.
In Part 2, I detailed how I was aggressively restricting my frequencies (cutting high-end air, rolling off sub-bass) to emulate a weathered, narrow 1960s acoustic capture. But the Kazdin recording wasn't weathered or restricted at all. It was the exact opposite: a cutting-edge, high-fidelity 1974 studio production designed to sound impossibly wide.
When that massive four-channel mix was collapsed into stereo decades later, it created a dense, immersive wash that I misinterpreted as the natural, heavy reverb of a massive physical space. Furthermore, there was no "dusty chapel" involved in any of these recordings. That space was an illusion I imposed on the temp track based on the liturgical visuals I was watching, the feedback about period correctness, and my own compositional intent already forming.
The three-variable model wasn't just operating on the director — it was operating on me. What I perceived as the sonic identity of the temp was partly the recording, and partly everything I brought to the listening.
Which is exactly the point.
The Meaning of a Piece Is Not Fixed at Composition — It Accumulates
Orff didn't write a liturgical piece. He didn't write for organ. He didn't write for a stone chapel, a violent scene, or a 21st-century streaming platform.
And yet something in that compound — the springtime text, MTT's mechanical vision, Kazdin's spatial engineering, a music editor's cut, a director's instinct, a composer's interpretation of feedback — landed against a dramatic scene and felt true.
None of those people were wrong. Each transformation added something the original couldn't have contained.
The meaning of a piece of music is not fixed at composition. It accumulates.
The temp track isn't a musical reference. It's a compound artifact — carrying the weight of every decision, accident, and transformation between the composer's first idea and the picture editor's timeline. When a director says "I want it to feel like this," they're responding to all of it at once, and they can't separate what came from where.
That uncertainty isn't a problem to solve. It's the starting point. Asking which of those three variables the director is actually responding to — that's the composer's job. You may need to honor one, translate another, and deliberately replace the third.
The IR chapel, the mic ratio work, the library blending, and the spectral shaping — none of that was recreating the temp. It was finding the sound the picture actually needed, independent of what the temp happened to be. The temp pointed a direction. The score finds the destination.
Orff likely couldn't have imagined how the seed of his idea would transform over time, or how its impact would eventually be shaped by something as unintended as the two-channel compression of a 1974 recording. He planted the seed, but a long line of performers, engineers, and conductors have been its stewards along the way. Even though my writing is a distant branch of that original tree, I've realized that for this film, I own that branch. I'm the current steward of that idea.