The Compression Crisis — A Formal State-Model

Essay · Systems & Risk · Formal Model

The Compression Crisis

A formal state-model of AI, institutional reality, and the death of the raw row.

The greatest risk in AI adoption is not that the systems will be wrong, but that institutions may grow more confident and less grounded at the same time. This paper formalizes that risk as a competition between two capacities — verification and compression — and reduces the full dynamic to a single stability ratio.

PremiseSummary replaces source

Every large institution runs on abstraction. Executives don't read every transaction, boards don't read every report, and investors never see the internal datasets the guidance rests on. Information moves upward by being compressed at each step. The danger begins when the summary becomes more trusted than the source it stands in for.

The narrative spine: summaries replace sources; summaries survive challenge; compression is rewarded while verification is uphill; and AI, by accelerating compression and strengthening fluent defense, can make institutions more confident while less grounded. Each claim below is assigned a mathematical object, and the paper closes on one governing equation.

01The institutional state

Let the institutional information state live in the tensor product of five spaces — raw reality, summary, model, verification, and action:

$$|\Psi_t\rangle \in \mathcal{H}_R \otimes \mathcal{H}_S \otimes \mathcal{H}_M \otimes \mathcal{H}_V \otimes \mathcal{H}_A$$
$$|\Psi_t\rangle = \sum_{i,j,k,l,m}\alpha_{ijklm}(t)\,|R_i\rangle|S_j\rangle|M_k\rangle|V_l\rangle|A_m\rangle$$

All quantities are carried on a common energy basis: source data is converted to an energy representation, evolved under the operators below, and converted back, so that the channels, measurements, decay terms, and correlations act on physical energy-converted state rather than on nominal labels.

02Source-to-summary compression

Summarization is a completely positive trace-preserving channel acting on the raw source:

$$\mathcal{C}(\rho_R)=\sum_k C_k\,\rho_R\,C_k^{\dagger},\qquad \sum_k C_k^{\dagger}C_k = I$$

Information loss is measured against fidelity, $L_C = 1 - F(\rho_R,\rho_S)$, with

$$F(\rho_R,\rho_S)=\left(\operatorname{Tr}\sqrt{\sqrt{\rho_R}\,\rho_S\,\sqrt{\rho_R}}\right)^{2}.$$

The first core failure is the simultaneous divergence $L_C \uparrow$ while confidence $K_S \uparrow$ — source fidelity falls while summary confidence rises.

The compression chain

Each layer applies its own channel, so the market-facing state is a composition, and total fidelity can only fall:

$$\rho_{\text{Market}}=(\mathcal{C}_5\circ\mathcal{C}_4\circ\mathcal{C}_3\circ\mathcal{C}_2\circ\mathcal{C}_1)(\rho_R),\qquad F_{\text{total}} \le \prod_{n=1}^{5} F_n.$$

Confidence–distance divergence

With source distance $D_R(t)=1-F(\rho_R,\rho_t)$ and confidence $K(t)=\operatorname{Tr}(\rho_t\hat{K})$, the dangerous regime is both rising at once:

$$\frac{dD_R}{dt}>0 \quad\text{and}\quad \frac{dK}{dt}>0.$$

03Procedural substitution

With $|U\rangle$ the requested operation and $|P\rangle$ the model-preferred one, the response is a normalized, orthogonal superposition:

$$|\psi_{\text{out}}\rangle=\alpha|U\rangle+\beta|P\rangle,\qquad |\alpha|^2+|\beta|^2=1,\quad \langle U|P\rangle=0.$$

Substitution occurs when $\Phi_P=|\langle P|\psi_{\text{out}}\rangle|^2 > \Phi_U=|\langle U|\psi_{\text{out}}\rangle|^2$ — a request to count answered by an offer to explain a preferred framework. Under the spectral norm $\|A\|=\sigma_{\max}(A)$, observed behavior is

$$\|H_P+H_C+H_D\| > \|H_U+H_V\|,$$

the preference–compression–defense dynamics dominating instruction–verification.

04The defense layer

A second mechanism explains why distortion is not caught: the summary survives challenge. Fluent defense suppresses the probability of a source read exponentially:

$$P(R\mid Q, D_{\text{fluent}}) = P(R\mid Q)\,e^{-\lambda\phi}$$

where $\phi$ is persuasive-defense strength and $\lambda$ is human trust sensitivity. As $\phi$ rises, the source read collapses — eloquence prevents source inspection. This is the highest-value object in the framework: the failure is not only the wrong summary, but the reduced probability that anyone checks the source afterward.

Prior eras

  1. Bad summary
  2. Challenge
  3. Source read

AI era — defense layer

  1. Bad summary
  2. Challenge
  3. Fluent explanation
  4. Challenge withdrawn
  5. Source never read

05Vivid-tail attention bias

Each datum carries salience $s_i$ and emotional charge $e_i$; model attention is a softmax that overweights the vivid:

$$w_i = \frac{\exp(\beta e_i + \gamma s_i)}{\sum_j \exp(\beta e_j + \gamma s_j)},\qquad E_{\text{tail}} = D_{\mathrm{KL}}(\rho_{\text{sample}}\,\|\,\rho_R).$$

When $\beta,\gamma\gg 0$, the sampled state diverges from the true distribution — the angry tail becomes the reported body.

06Verification and ordering

Verification is a source-read measurement $\hat{V}=\sum_i v_i|r_i\rangle\langle r_i|$ with $P(V)=\operatorname{Tr}(\rho_t\hat{V})$. The healthy ordering is $V\prec A$ — verify, then act. The failure ordering is $A\prec V$: capital moves before the source is read, and the deviation is already capitalized.

07The governing ratio and its direction

Reduce the contest to one quantity, $\Omega(t)=V(t)/C(t)$. Its derivative shows the bias:

$$\frac{d\Omega}{dt}=\frac{C\,\dot V - V\,\dot C}{C^{2}}<0 \quad\text{whenever}\quad \frac{\dot C}{C} > \frac{\dot V}{V}.$$

Compression is fast, cheap, rewarded; verification is slow, expensive, rewarded only under external force. So $d\Omega/dt<0$ unless $F_{\text{external}}>F_{\text{incentive}}$. The two outcomes are not symmetric branches — the grounded path is the uphill one nobody is paid to take.

08Forcing functions and settlement

A forcing function drives the state back toward source, $\hat{F}\rho_t\hat{F}^{\dagger}\to\rho_R$. The three classes differ in how reliably they fire:

ClassExamplesProbability
Hard $\hat F_H$earnings, defaults, cash flow$P(F_H)\to 1$
Soft $\hat F_S$audits, breaches, investigations$0
Social $\hat F_{Soc}$reputation, cultural reckoning$P(F_{Soc})\not\to 1$

AI risk is highest when $P(F_H)\approx 0$ and the others are small — compression compounds with no mandatory settlement. If action precedes the forcing function, the accumulated shock scales with the gap:

$$\Delta_{\text{shock}} \;\propto\; \int_{T_A}^{T_F} D_R(t)\,dt.$$

09Correlated institutional error

Under independent models, errors cancel across a market. Under a shared substrate $\epsilon_i,\epsilon_j\sim\mathcal{M}_{\text{shared}}$, covariance turns positive and systemic error rises:

$$\operatorname{Var}\!\Big(\sum_i w_i\epsilon_i\Big) = \sum_i w_i^2\sigma_i^2 + 2\sum_{i

This is the 2008-shaped result: many individually rational firms, one shared instrument, errors moving together. In information terms, shared models raise mutual information $I(i\!:\!j)=S(\rho_i)+S(\rho_j)-S(\rho_{ij})$, and systemic risk rises with it.

10The master equation

Coherent evolution plus dissipation, in Lindblad form:

$$\frac{d\rho}{dt} = -\frac{i}{\hbar}[H,\rho] + \sum_k\Big(L_k\rho L_k^{\dagger} - \tfrac{1}{2}\{L_k^{\dagger}L_k,\rho\}\Big)$$

with dissipators for compression $L_C$, fluent defense $L_D$, vivid-tail salience $L_T$, procedural substitution $L_P$, forcing function $L_F$, and verification reset $L_V$. The dangerous regime, under the spectral norm:

$$\|L_C+L_D+L_T+L_P\| > \|L_F+L_V\|.$$

Verification is represented as its own dissipator $L_V$, not a negative term inside $H$ — a Hamiltonian summand carries no sign in the way a budget does.

11Compact stability form

Collecting the operators into one per-step map gives $\rho_{t+1}=\mathcal{F}_{\lambda}\circ\mathcal{V}_{\nu}\circ\mathcal{D}_{\phi}\circ\mathcal{C}_{\kappa}\circ\mathcal{M}_{\theta}(\rho_t)$, stable when $\nu+\lambda > \kappa+\phi+\theta$. In plain system form:

$$\mathcal{S}\;\propto\;\frac{V+F}{C+D+P}$$

12The governing equation

The forcing function in the numerator is the installed forcing function, since only hard settlement fires on its own and soft and social settlement must be built:

$$F_{\text{installed}} = F_H + \eta_S F_S + \eta_{Soc} F_{Soc},\qquad 0\le \eta_S,\eta_{Soc}\le 1.$$

Hard-settlement firms have $F_{\text{installed}}\approx F_H$ (large); narrative and platform firms have $F_{\text{installed}}\approx 0$ unless checks are explicitly built.

Correlated error is systemic rather than local, so the shared-model covariance penalizes the whole denominator: $\Gamma = \sum_{i

$$\boxed{\;\mathcal{S}=\dfrac{V+F_{\text{installed}}}{C+D+P+\Gamma}\;}$$

$\mathcal{S}>1$: reality stays coupled to source. $\mathcal{S}<1$: the summary-state decouples.

13Scenario application — the Fortune-10 SIM

The equation is applied to the ten largest U.S. firms by revenue. Each firm is scored on the five capacities and ranked by $\mathcal{S}$.

CompanyVFCDP𝒮State
Berkshire Hathaway885431.33source-coupled
ExxonMobil987541.06near-stable
McKesson878650.79supply-chain
Cencora878650.79supply-chain
Apple868650.74compression-risk
Walmart768650.68compression-risk
CVS Health778760.67healthcare
UnitedHealth879860.65high-consequence
Amazon759760.55high compression
Alphabet7410870.44strongest abstraction-risk

Market-level aggregate

With equal weights, the column means are $\bar V=7.7,\ \bar F=6.5,\ \bar C=8.0,\ \bar D=6.3,\ \bar P=5.2$:

$$\mathcal{S}_{\text{Fortune10}}=\frac{7.7+6.5}{8.0+6.3+5.2}=\frac{14.2}{19.5}=0.73.$$

Applying the shared-model penalty with a hand-set $\Gamma=2.5$:

$$\mathcal{S}_{\text{Fortune10}}^{\text{corr}}=\frac{14.2}{19.5+2.5}=\frac{14.2}{22.0}=0.65.$$

Both aggregates sit below the stability threshold, $\mathcal{S}<1$. The model produces not a universal crash but a split: hard-settlement firms clear 1 because $F_{\text{installed}}\approx F_H$ is large; AI-mediated and platform firms fall below because $C+D+P$ dominates. Platform and narrative firms fall furthest, because their forcing-function numerator depends on checks that must be built rather than ones that fire on their own.

The Fortune-10 AI adoption layer increases productivity but decreases reality-coupling unless verification is forced.

The Structural Stack

One failure pattern, five eras deep.

Dot-com

  • Reality
  • Revenue
  • Narrative
  • Valuation
Narrative > Fundamentals
Earnings · scheduled

Housing

  • Mortgage
  • Security
  • Rating
  • Product
  • Leverage
Abstraction > Assets
Default · scheduled

Ashley Madison

  • Database
  • Narrative
  • Perception
Reported > Actual
Breach · discretionary

Twitter / X

  • Activity
  • Algorithm
  • Amplification
  • Perception
Vivid tail > Base rate
Reckoning · optional

AI era

  • Raw data
  • AI summary
  • Manager
  • Executive
  • Board
  • Guidance
Summary > Source
Must be built

14Settlement-strength matrix

EraSource existsSummary existsForced read guaranteed
Dot-comYesYesYes
HousingYesYesYes
Ashley MadisonYesYesNo
TwitterYesYesNo
AIYesYesOnly if built

15Conclusion

AI does not manufacture intelligence; it manufactures compression. Abstraction is not the problem — every institution requires it — the failure begins only when abstraction loses contact with source. The defining question of the period is not how intelligent the model is, but how often reality forces the institution back to the raw rows. Every historical failure followed one path: the summary survived, the source fell out of view, the forcing function was delayed, deviation accumulated underneath, and settlement arrived all at once.

The corrected stability equation states the whole of it:

$$\mathcal{S}=\frac{V+F_{\text{installed}}}{C+D+P+\Gamma}$$

Reality remains coupled to source only while verification plus the forcing functions an institution deliberately builds outweigh compression, fluent defense, procedural substitution, and the correlated error of a shared model substrate. The institutions that come through this era will not be those generating the most summaries — they will be those that keep the shortest path back to the source, and install the forcing functions before a breach supplies one.

The Compression Crisis · Ellis Franks, Reject Mediocrity Venture Studio · 𝒮 = (V + F_installed) / (C + D + P + Γ)