Phantom Stabilizers: Designing Wargames That Remove the Assumptions Keeping Your Model From Breaking
E. SokolovMost wargames are secretly load-bearing. Not the rules, the assumptions underneath them. Someone decided early in the design process that oil markets remain liquid, that satellite uplinks hold, that adversary logistics degrade at historical rates. Nobody wrote these down. They became load-bearing walls, invisible until you try to move them.
Photo by Matias Luge on Pexels.
Call them phantom stabilizers: the background conditions a simulation treats as permanent that the real world might revoke without notice.
Here's the problem. A well-designed wargame can surface all manner of tactical surprise, political miscalculation, and intelligence failure, and still miss a catastrophic outcome because phantom stabilizers are holding the game world together. Players experience stress within a system that cannot actually collapse. When the system collapses in reality, no one practiced for it.
Where Phantom Stabilizers Hide
They cluster in three places.
Infrastructure baselines. Power grids, internet routing, GPS availability, port throughput, these usually run in the background as fixed values. RAND's 1964 SAFE wargame assumed commercial communications would absorb military overflow. That assumption was fine until the Cold War scenarios started loading it. Real infrastructure assumptions need a stress budget, not a steady-state value.
Adversary rationality floors. Designers routinely assume adversaries stay above some minimum threshold of coherent decision-making. But regimes under terminal pressure, insurgent cells facing annihilation, or markets in a liquidity crisis can fall below that floor fast. The 2002 Millennium Challenge exposed one version of this: a Red force player behaved in ways Blue's models classified as irrational, which is a polite way of saying the model had a rationality phantom stabilizer and Van Riper found it.
Procedural adhesion. Wargames assume players follow their own doctrinal procedures, reporting up chains of command, waiting for authorization, running down checklists. Remove that assumption and you get fratricide, unauthorized escalation, and cascading command confusion. Most games never test what happens when tired, overwhelmed players start cutting corners on their own procedures.
How to Hunt Them
Before the game runs, run a stabilizer audit. This is a structured pre-mortem aimed specifically at conditions, not events.
For each major system in your simulation, logistics, communications, financial settlement, command authority, ask the design team: what would have to stay true for this system to function as modeled? Write those conditions down. Then ask: has any of these ever been false in history, even briefly?
You'll find them fast. The audit usually surfaces eight to twelve phantom stabilizers in a medium-complexity game within two hours.
graph TD
A[Identify major systems] --> B{Run stabilizer audit}
B --> C[List background conditions per system]
C --> D{Historical falsifiability check}
D --> E[/Confirmed phantom stabilizer/]
D --> F[Defensible assumption, document it]
E --> G((Schedule removal scenario))
Once you have the list, don't remove all stabilizers simultaneously. That produces chaos with no diagnostic value, players can't learn what caused what. Pull one stabilizer per scenario iteration and observe where the game world strains first.
Designing the Removal
The removal has to be diegetic, it needs to feel like something that happened, not like a facilitator arbitrary adjustment. A few techniques that work:
Slow degradation injects. Rather than announcing "GPS is now unavailable," feed players a trickle of anomalous position reports over several turns. Let them rationalize it. Most players will. Watch how long rationalization persists after the signal has become undeniable.
Third-party actor triggers. Have a neutral or background actor, a financial institution, a port authority, a weather service, send an out-of-band message that quietly signals the stabilizer is failing. Players who are watching peripheral channels will catch it; players tunneled on their primary objectives won't. That gap in awareness is itself a finding.
Retrospective reveals. Run the scenario through completion, then show players a timeline of when the stabilizer actually failed. Ask them to mark when they believe they noticed. The distance between those two marks is your institutional lag estimate, and it's almost always larger than anyone guesses.
What You're Actually Measuring
Removing a phantom stabilizer isn't about chaos for its own sake. You're measuring three things: detection lag (how long before anyone notices the background condition has changed), adaptation rate (how fast players revise plans once they do notice), and brittleness concentration (which downstream decisions were load-bearing on the failed assumption).
That last one matters most. Brittleness concentration tells you which of your organization's real-world decisions are secretly depending on conditions you've never explicitly verified. That's not a wargame finding, that's an institutional vulnerability. The game just found it first.
Which is the job.
Get Uncertainty Game in your inbox
New posts delivered directly. No spam.
No spam. Unsubscribe anytime.