The Thesis Audit: What Survived Independent Scrutiny

The audit divides the thesis into its structural components and assesses each. The result is a clear line between what can be cited with confidence and what must be revised or retired.

#	Claim	Verdict	Original Value	Audit Finding	Revision Required
1	Liberty-Yield β	Confirmed	β=−0.35, R²=0.37	Reproduces exactly	None
2	Great Decoupling	Confirmed	r: 0.79 → 0.57	Confirmed, 39 capable autocracies	None
3	78% Holdout Accuracy	Confirmed	78%	Real, but +5pp over 73% baseline	Add baseline context
4	Extreme Velocity Cataloging	Confirmed	US as fastest decliner	Confirmed even at moderate estimates	None
5	Bistable Dynamics	Refuted	Two deep wells	k ≈ 0, no deep wells	Upgrade to tristable model
6	Markov Property	Refuted	Stage-only transitions	Rejected at Stages 2, 5, 6	Add path dependence
7	Shock σ = 3–8	Refuted	σ = 3–8	σ = 0.45–4.45	Replace with data-driven σ
8	62% Tyranny Probability	Refuted	P = 62%	P ≈ 0% (data-driven)	Reframe as P(L<50) post-break
9	Event Horizon at ~12%	Refuted	~12% recovery	L ≈ 52–55, recovery 3.0%	Recalibrate threshold
10	US Liberty = 48	Partial	L = 48	Mean 76.6 (range 57–84)	Use range L = 57–72
11	US Velocity −18/yr	Partial	−18/yr (2yr)	−4.2/yr (10yr std.)	Cite sstandardised 10yr rate
12	Treasury Premium 2,080bp	Partial	2,080bp	200–580bp (5–10yr)	Use defensible range

Survives Audit

The dataset — 91 countries, 225 years, 1,656 observations. A genuine contribution to the field. No errors detected in data construction.
Liberty-Yield relationship — β = −0.35, R² = 0.37. Reproduces exactly. Economically meaningful. The core econometric finding is sound.
The Great Decoupling — Correlation breakdown from r = 0.79 to r = 0.57. 39 capable autocracies. An original and robust finding.
Extreme velocity cataloging — Cross-country decline comparisons are valid. US decline stands out even at conservative estimates.
Credit lag insight — 3–12 year lag between democratic erosion and sovereign credit deterioration. Novel and important for bond market analysis.
Path dependence — Direction of travel matters more than current position. Validated by the Markov rejection tests.
V-Dem alignment — Thesis's directional assessment of US decline was vindicated by V-Dem's September 2025 reclassification.

Does Not Survive

Original bistable dynamics — Superseded by the tristable three-basin model. No evidence for two deep wells in the data.
62% tyranny probability — A phantom generated by inflated shock volatilities. Data-driven estimate is approximately 0%.
L = 48 as established fact — Below the credible range. Multi-index mean is 76.6. Defensible range: 57–84.
Original 650bp Treasury mispricing — Overstated by 3.5–10x. Defensible range: 200–580bp over 5–10 years.
Stage-based predictions — AR(1) outperforms the thesis's stage-transition model. Simple models beat complex ones here.
Stipulated shock σ values — Wrong at every stage by 2–7x. Must be replaced with data-driven estimates throughout.

The pattern is clear: the architecture of the thesis survives — the dataset, the relationships, the directional findings, the conceptual frameworks. What does not survive are the specific numerical calibrations — the point estimates, the volatilities, the probabilities. The thesis was more right about the world than about its own parameters.

Liberty (L)	Stage	Velocity (10yr)	Event Horizon?	Historical Reversal %	Predicted Yield Spread	Narrative
48	Stage 5–6	−4.2/yr	Below	3.0%	+1,120bp	Deep erosion. Below event horizon. Recovery extremely unlikely without external intervention.
52	Stage 5	−4.2/yr	At threshold	3.0%	+980bp	Event horizon boundary. Last realistic exit point for self-correction.
57	Stage 4	−4.2/yr	Approaching	12%	+805bp	Serious erosion. Institutional capture underway. Reversal possible but requires sustained effort.
63	Stage 3–4	−4.2/yr	Above	28%	+595bp	Democratic backsliding. Norms eroding, institutions under pressure. Comparable to Hungary 2012.
72	Stage 2–3	−4.2/yr	Above	54%	+280bp	Early-stage erosion. Press freedom declining, judicial independence under strain. Recoverable.
77	Stage 1–2	−4.2/yr	Above	72%	+175bp	Multi-index mean. Norm erosion phase. Still a functioning democracy by most measures.
84	Stage 1	−4.2/yr	Above	89%	+70bp	Top of credible range. Minor democratic stress. Comparable to France or UK.

Element	Original	Revised
Dynamics model	Bistable (two wells)	Tristable (three basins: democracy, hybrid, autocracy)
Shock volatility	σ = 3–8 (stipulated)	σ = 0.45–4.45 (data-driven)
US Liberty estimate	L = 48 (point estimate)	L = 57–72 (credible range)
US decline velocity	−18/yr (2-year window)	−4.2/yr (10-year sstandardised)
Tyranny probability	62% within 15 years	∼0% (tyranny); 69% P(L<50) post-2006 break
Treasury mispricing	2,080bp (implied)	200–580bp over 5–10 years
Event horizon	~12% recovery rate	L ≈ 52–55; recovery 3.0% (CI: 0.7–6.0%)
Transition model	Markov (stage-only)	Path-dependent (direction of travel incorporated)
Prediction baseline	Stage-transition model	AR(1) with structural breaks

Phase	Tasks	Focus	Key Method
Phase 1: Reproduction	5	Reproduce headline statistics from raw data	Exact replication of β, R², correlation coefficients, holdout accuracy
Phase 2: Stress Testing	5	Test assumptions underlying the model	Markov tests, σ estimation, structural break detection, bootstrap CIs
Phase 3: Counter-Arguments	5	Test the strongest objections to the thesis	Sub-sample analysis, GDP conditioning, democratic tenure stratification
Phase 4: Recalibration	5	Produce revised estimates using audit-validated parameters	Data-driven Monte Carlo, multi-index reconciliation, recalibration table

The Thesis Audit:What SurvivedIndependent Scrutiny

The Numbers at a Glance

Why We Audited Our Own Work

1. Intellectual Honesty Over Narrative Coherence

2. Pre-Publication Stress Test

3. Commitment to the Data Over the Narrative

The Verdict

The 12 Claims: Confirmed

#1 Liberty-Yield Relationship: β = −0.35, R² = 0.37

#2 The Great Decoupling: Correlation Dropped from r = 0.79 to r = 0.57

#3 78% Holdout Prediction Accuracy

#4 Extreme Velocity Cataloging: US Decline Stands Out

The 12 Claims: Refuted

#5 Bistable Dynamics: Mean-Reversion k ≈ 0, No Deep Wells

#6 Markov Property: Rejected at Stages 2, 5, and 6

#7 Shock Volatility σ = 3–8 by Stage

#8 Tyranny Probability: Original 62% Was a Phantom

#9 Event Horizon at ~12% Recovery

The 12 Claims: Partial

#10 US Liberty Score = 48

#11 US Velocity = −18 Points/Year

#12 Treasury Reserve Currency Premium = 2,080bp

All 12 Claims at a Glance

What Survives

Survives Audit

Does Not Survive

The Recalibration Framework

Counter-Arguments That Landed

CA5: Policy Erosion vs. Structural Erosion

CA6: Mean Reversion in Long-Standing Democracies

CA7: The GDP Threshold

Audit Limitations

1. Python Standard Library Only

2. Thesis's Own Data Only

3. AR(1) Is Also a Simplification

4. Small N for US-Specific Claims

The Strongest Version of the Thesis

What Changes Going Forward

Audit Methodology Summary

Statistical Tests Employed

Data Sources

The Thesis Audit:
What Survived
Independent Scrutiny