Model Map¶

How the project's components fit together — what's built, what's next, what's deferred, and how data flows between them.

1. Full model stack¶

flowchart TD
    H[Historical Baseline
Epoch model data
BUILT] --> C[Comparison / Calibration Target]
    S[Supply Capacity
chips + power + data centers + capex + utilization
BUILT] --> A[Compute Allocation
BUILT]
    C --> A
    A --> E[Effective Compute
FUTURE]
    E --> K[Capability Mapping
FUTURE]
    K --> P[Probabilistic Projections
FUTURE]
    P --> F[AI Economy Feedback
FUTURE]

Three boxes are built today: historical baseline, supply-capacity model, and the allocation layer (which emits largest_frontier_run_flop_by_year, the bridging quantity). The next layer is effective compute, which adjusts raw frontier-run FLOP for algorithmic-efficiency gains. Everything beyond effective compute is deferred.

2. Built vs next vs future¶

Component	Status	Run command	Output entry point
Historical baseline	✓ built	`uv run historical`	`outputs/tables/historical_trend_estimates.csv`
Supply capacity model	✓ built	`uv run supply`	`outputs/tables/supply_fundamental_inputs_by_year.csv`
Compute allocation	✓ built	`uv run allocation`	`outputs/tables/allocation_largest_frontier_run.csv`
Review layer (DuckDB + Excel)	✓ built	`uv run database` / `uv run workbook`	`outputs/database/ai_economy.duckdb`, `outputs/workbooks/ai_economy_model_review.xlsx`
Scenario explorer (Streamlit)	✓ built	`uv run demo`	live app at `localhost:8501`
Effective compute	✗ next	—	—
Capability mapping	✗ future	—	—
Probabilistic projections	✗ future	—	—
Economy feedback	✗ future	—	—

For per-component inputs/outputs/contracts see component_contracts.md. For per-output-file interpretation see output_guide.md.

3. Data flow¶

The forward causal chain on its own (excluding the historical-baseline calibration arrow):

flowchart LR
    I[Physical Inputs
chips, power, data centers, capex] --> S[Usable AI Compute]
    S --> A[Compute Allocation]
    A --> T[Largest Frontier Training Run]
    T --> E[Effective Compute]
    E --> C[Capabilities]
    C --> R[Revenue / Reinvestment Feedback]

Read this as: physical and financial inputs → total annual usable compute → allocated across uses → the largest single training run → adjusted for algorithmic improvements → mapped to capabilities → economic feedback that flows back into capex on the input side.

The two existing components sit at the left edge of this chain. The allocation layer is the next box to the right. Everything past the "largest frontier training run" node is currently a placeholder.

4. Historical baseline vs forward model¶

The single most important conceptual point in this repo.

	Historical baseline	Supply capacity model
Asks	What did the largest single frontier training runs do historically?	How much total AI compute can exist per year going forward?
Time horizon	1950–2026	2024–2040
Quantity	FLOP per single training run	FLOP per year, total global usable AI compute
Source	Epoch's "Notable AI Models" dataset	Sourced + synthesized assumptions on chips, power, data centers, capex, utilization
Headline number	5.97× per year, Rule A 2018+ frontier-run trend	45.7%/yr CAGR (base scenario), 1.65e+31 FLOP/yr by 2040
Role in the model	Calibration / comparison target	Forward causal model

These are different quantities. Comparing them directly — e.g. asking "why is the supply model's growth rate slower than the historical trend?" — is the most common reading mistake. The historical trend tracks one training run's compute, which is a share of total usable compute. The share has grown over time: in 2018 a frontier run was a tiny fraction of global AI compute; in 2024 it's a much larger fraction. So a single trend can outpace total-supply growth for a while without contradiction.

Historical baseline = single frontier training runs. Supply capacity model = all global usable AI compute. They are not directly comparable as forecasts.

5. The allocation layer (now built)¶

The allocation layer splits annual usable compute across:

Training (further decomposed into largest single frontier run, other frontier-lab training, and non-frontier training)
Inference (production serving)
AI R&D experiments (ablations, scaling-law sweeps, post-training research)
Post-training (RLHF, fine-tuning passes)
Safety / evals
Reserved / idle / fragmented

Run via uv run allocation; full memo at allocation_findings.md.

The headline output is largest_frontier_run_flop_by_year, the bridge between the supply model's total-annual-compute trend and the historical baseline's single-frontier-run trend. The allocation pipeline also produces allocation_vs_historical_trend.csv quantifying the year-by-year gap between projections and the historical extrapolation.

Headline finding: under base supply × base allocation, the largest frontier run grows at 27.6%/yr CAGR 2024→2040, reaching ~6.9e+28 FLOP by 2040. The historical Rule A 2018+ extrapolation of 5.97×/yr (~497% CAGR) reaches ~1e+37 FLOP by 2040 — a ~7 OOM gap that no combination of supply + allocation reproduces. Three honest readings (the historical trend was already slowing, allocation parameters may be conservative, and supply fundamentals genuinely cap single-run growth) all probably contribute. The effective-compute layer (next) may close some of this by adjusting for algorithmic efficiency gains.

Appendix: file-level architecture¶

pipelines/historical.py    →  Builds historical-baseline outputs
pipelines/supply.py        →  Builds supply-capacity outputs
pipelines/allocation.py    →  Builds allocation outputs (depends on supply)
                             (next: pipelines/effective_compute.py)

model/                     →  Reusable engine code
  runtime.py               (shared paths, colors, attribution)
  data_cleaning.py         (historical: Epoch CSV → processed schema)
  frontier_filters.py      (historical: Rules A/B/C)
  trend_fitting.py         (historical: log-linear fits)
  historical_charts.py     (historical: chart helpers)
  supply_engine.py         (supply: H100-eq stock + 4 limits + utilization)
  allocation_engine.py     (allocation: 6 buckets + training-pool decomp)
  review_database.py       (review: DuckDB build + 6 views)
  workbook_export.py       (review: 11-sheet Excel)
                             (next: model/effective_compute_engine.py)

Inputs live in data/raw/ (immutable) and data/assumptions/ (scenario-keyed YAML). Processed datasets land in data/processed/. Outputs go to outputs/charts/ (PNG) and outputs/tables/ (CSV).