The Half-Life of History

A personal experiment in measuring what humanity has forgotten

I watched David Reich's Big Think interview on ancient DNA1 and it rewired how I think about the past. Reich is a geneticist at Harvard who showed that entire populations left no written record at all. The Yamnaya replaced roughly 75% of Central European genetic ancestry around 4,500 years ago2. We only know because of DNA. Ghost populations existed, thrived, and vanished, detectable only as statistical traces in modern genomes3.

That raised a question I couldn't shake. If genetic signal decays exponentially with time (recombination chops ancestral blocks smaller each generation, drift erases rare lineages), does everything else decay the same way? Academic papers, biographical records, surviving sources, how often we mention a century in books. I wanted to measure it. I built this project using Claude Code, Anthropic's AI coding assistant, to collect the data, run the statistical analysis, and render the charts you see below.

The central finding: present-day attention to the past follows an exponential decay curve with a half-life of about 95 years (95% interval: 87 to 107). For every 95 years further back you look, the above-baseline portion of the composite knowledge index halves. An earlier version of this essay reported 84 years; that number came from a misspecified model and a scaling error, and the correction is explained below rather than quietly swapped in.

Five Thousand Years of Data, at a Glance

I assembled five distinct measures to capture the breadth of historical knowledge. Academic works (via OpenAlex, an open scholarly index described by Priem et al.7) counts research papers tagged to each period. Named individuals tallies Wikipedia biographical articles for people born in each century. Source proxy estimates surviving primary sources: cuneiform tablets and inscriptions for antiquity, manuscript production for the medieval period (following Buringh & Van Zanden 20094), and print title counts for the early modern and modern eras. Ngram discourse measures how frequently each century is mentioned in the Google Books English corpus6. And films, a supplementary cultural metric, counts movies set in each period.

The chart below plots all five metrics for every time period, on a logarithmic scale (use the buttons to switch to linear). Toggle individual metrics on or off by clicking the legend. The pattern is unmistakable: every single metric climbs steeply as you approach the present.

How to read this chart: Each line tracks a different type of historical evidence across 50 centuries. The vertical axis uses a logarithmic scale: each gridline represents a tenfold increase. Click any label in the legend to show or hide that metric. Use the Log/Linear buttons to switch scales.
Line chart of five historical knowledge metrics (academic works, Wikipedia biographies, surviving sources, ngram mentions, films) across 33 periods from the 3rd millennium BC to the 21st century, each climbing steeply toward the present on a log scale.
Figure 1. Five metrics of present-day attention to the past, 3rd millennium BC to the 21st century (log scale).

The 20th century dominates every measure: 1.37 million Wikipedia biographies, 156,000 academic works, and an estimated 85 million publications. The 3rd millennium BC, by contrast, has just 385 known individuals and 50,000 surviving inscriptions. Recent history doesn't just have more. It has orders of magnitude more.

Knowledge Decays Like a Radioactive Isotope

Samuel Arbesman, in The Half-Life of Facts5, proposed that knowledge has a measurable decay rate. I tested this directly. After combining four metrics into a single composite index (using z-scores to put them on a common scale), I tested seven candidate models: exponential decay with and without a baseline, a two-phase (biexponential) model, power laws with and without a baseline, logarithmic, and linear. The exponential family won decisively. The two-phase model fits best by AICc, and the single exponential with a baseline is a close second. I use the single exponential as the primary specification, an honest tradeoff: with 32 data points I would rather report the model with two fewer parameters, even though it is technically the second-best fit. Both are reported in the methodology section, and the two-phase fit turns out to be the more interesting result. I will come back to it.

Just as uranium-238 decays at a predictable rate, historical knowledge follows a smooth exponential curve. The best-fit model gives a half-life of about 95 years. That means for every 95 years further back you look, the above-baseline portion of the composite knowledge index roughly halves. The shaded band below shows the uncertainty range: the true half-life likely falls between 87 and 107 years.

Here I owe you a correction. The first version of this essay reported a half-life of 84 years. Two things were wrong with how I got that number. First, the model: I fit an exponential forced to decay all the way to zero, but the composite index is built from z-scores and its ancient tail sits on a floor near minus 0.33. A curve required to reach zero cannot follow data that never gets there, and the residuals said so loudly (the published fit's Durbin-Watson statistic was 0.17, which is textbook misspecification). Second, the scaling: the published pipeline standardized each metric using only the century-length periods, accidentally including the incomplete 21st century. Fixing the model alone moves the estimate to about 109 years. Fixing the scaling alone moves it to 78.5. Fixing both, which is the only defensible choice, gives 95 years, and the corrected notebook reproduces the old 84 exactly when I rerun it under the old specification. R-squared rises from 0.83 to 0.99 and the autocorrelation largely disappears (Durbin-Watson 1.17). One more honesty note: the old point estimate of 84 falls outside the new interval of 87 to 107. That is what fixing a misspecified model looks like; the old uncertainty range (23 to 101) came from that same misspecified model and an unstructured bootstrap dominated by a few high-leverage points, so neither the old point nor the old interval was reliable. I have kept the original number visible here because a result that changes when you fix the model is exactly the kind of thing this essay is about.

The residual plot underneath reveals which centuries defy the trend. Points far from the zero line are centuries with unusual stories to tell. Two stand out: the 1st century CE sits well above the curve (+2.45 standard residuals, driven by its biblical and Roman cultural footprint), while the 18th century sits below it (2.38), receiving less attention than its position in time would predict.

How to read this chart: Each blue dot is one century's combined knowledge score (an average of all four metrics, standardized to the same scale). The red curve is the best-fit exponential model. The shaded band shows statistical uncertainty. Below, the residual plot shows which centuries deviate most from the overall trend.
Exponential decay curve of the composite historical knowledge index versus years before present, with a fitted half-life of roughly 95 years (95% interval 87 to 107 years), a bootstrap confidence band, and a residual plot beneath.
Figure 2. Composite knowledge index with exponential fit: half-life roughly 95 years (95% interval 87 to 107 years), residuals below.

Why Roughly a Century?

A half-life of about 95 years invites an obvious question: is it real, or just an artifact of one particular dataset and method? The strongest test of any empirical claim is independent replication: a different team, using different data, arriving at the same answer.

The most striking corroboration is structural. The model that fits my data best is not the single exponential at all but a biexponential: two decay processes running at once, a fast component with a half-life of 84 years and a slow one of roughly 800 years. That is the same two-regime form Candia et al. found in 2019 for songs, films, and biographies9: a quickly fading communicative layer and a slowly fading cultural one. I did not go looking for their model. It emerged from the model comparison on a dataset spanning five millennia instead of decades.

There is also a consistency check on the fast component. Michel et al., in their 2011 Science paper that launched Google Ngrams as a research tool6, tracked the fame trajectories of individual people by measuring how quickly a person's name rises and falls in book mentions. For people born around 1865, the post-peak half-life of fame was roughly 73 years. My fast component lands at 84, within a few decades of their figure despite an entirely different method: a composite of four metrics, z-scored, spanning five millennia instead of two centuries, measuring eras instead of individuals. A skeptic would note that Google Ngrams appears in both studies, and that is fair. But in my composite, Ngrams is one of four equally weighted metrics. The other three (academic papers, Wikipedia biographies, and surviving sources) have nothing to do with book mentions.

There may be a reason the fast layer fades over decades rather than centuries. The Egyptologist Jan Assmann drew a distinction between what he called communicative memory (what gets transmitted orally across three or four generations, roughly 80 to 100 years) and cultural memory, the kind that survives only when institutions write it down and teach it8. A half-life of roughly a century sits right at that boundary. It is approximately the point where the last person who heard the story firsthand dies, and whatever was not committed to text, stone, or institutional habit begins to vanish.

So three independent anchors bracket the fast layer: Michel's 73 years, my fast component of 84, and Assmann's 80-to-100-year communicative-memory boundary. The headline figure of 95 is the single-exponential compromise between the fast layer and the slow one, which is why I gloss it as roughly a century rather than pretending to more precision than the data supports.

Historians Amplify the Decay, Not Compensate for It

The obvious objection is that of course we know more about recent history. More sources survived. The more interesting question is whether historians compensate for lost sources by studying ancient periods more intensely, or whether they make the imbalance worse.

The chart below answers this by comparing two curves: the green line tracks source availability (surviving tablets, manuscripts, and books), while the blue line tracks academic attention (OpenAlex publication counts). Where academic attention exceeds source availability, the green shading marks an era as overstudied. Where attention falls below sources, the red shading marks it as understudied.

How to read this chart: The green line tracks how many primary sources survive for each era. The blue line tracks how much academic attention that era receives. Green shading marks periods where scholars publish more than the surviving sources would predict (overstudied). Red shading marks periods where sources exceed scholarly attention (understudied).
Comparison of z-scored source availability against z-scored academic attention per century, with green shading where historians overstudy an era and red shading where surviving sources outpace scholarly attention.
Figure 3. Source availability versus academic attention by era; shading marks overstudied (green) and understudied (red) periods.

The pattern is stark. Almost every period before the 16th century is understudied (the classical 1st century, with its Roman and biblical pull, is the lone exception at +0.22). Historians give these periods less attention than their surviving source base would predict; the 12th century sits almost exactly at the crossover. From the 16th century onward, attention runs ahead of sources, and the gap widens steadily: +0.43 for the 16th century, +0.57 for the 17th, +0.93 for the 18th, peaking at +2.20 for the 19th, likely reflecting Anglophone historiographic priorities. Then comes the surprise: the 20th century is understudied (1.27). Source production exploded faster than scholarly attention could keep up. (The 21st century is excluded from this comparison: it is not yet history, and its partial data would distort the standardization.)

One important caveat: the source proxy changes measurement method across eras (inscriptions in antiquity, manuscripts in the medieval period, print titles after Gutenberg). Cross-era comparisons are therefore approximate, not exact. But the overall direction of the bias is clear regardless.

What Drove the Acceleration

When all four metrics (plus the supplementary films count) are overlaid on a single timeline, the overall trajectory is a smooth upward climb, consistent with the exponential model from the previous chapter. But three historical transitions contributed disproportionately to that acceleration. Around the 1450s, the printing press began a slow transformation of source availability; the named-individuals and academic-works lines begin to steepen in the centuries after Gutenberg, though the effect takes generations to appear. Around 1800, mass literacy, state record-keeping, and industrialization pushed most metrics into steeper growth: the 19th century shows the largest single-century jumps across multiple measures. And the 20th century's digital infrastructure produced a final surge in some metrics, though not all: academic works barely grew from the 19th century, while named individuals and source volumes exploded.

Use the Log/Linear buttons below to switch between scales. On a linear scale, the 20th century towers so far above everything else that ancient history becomes a flat line at zero. The log scale reveals the continuous nature of the acceleration: there is no single moment where everything changes at once, but rather a compounding process that these three transitions each amplified.

How to read this chart: All five metrics plotted together on one timeline. Dashed vertical lines mark three historical transitions. Use the Log/Linear buttons to switch scales: linear makes the 20th century dominate; logarithmic reveals the full shape of the curve.
All five historical knowledge metrics overlaid on one timeline with dashed markers at the printing press, the industrial revolution, and the digital era, showing compounding acceleration toward the present.
Figure 4. All metrics on one timeline; dashed lines mark the printing press, industrialization, and the digital era.

A Bird's-Eye View Reveals Dark Zones and Bright Hotspots

The heatmap below offers a compact view of all five metrics across all 33 time periods, with warmer colors indicating higher relative knowledge density (ranked within each metric, so colors reflect relative standing rather than absolute values). The dark zones (the 10th9th centuries BC, the early medieval period) are the deep troughs of human knowledge, eras where very few named individuals survive, minimal inscriptions exist, and academic attention is nearly absent.

In contrast, bright hotspots mark periods with disproportionate cultural footprints: Classical Greece (5th century BC), Imperial Rome (1st2nd century CE), and of course the entire modern era. The rightmost columns glow with intensity.

How to read this chart: Each cell is colored by rank within its row. Warmer colors mean higher relative scores; dark zones are periods where that metric is lowest. Read across a row to see how one metric varies over time. Read down a column to compare all metrics for one period.
Heatmap of rank-transformed values for five knowledge metrics across 33 historical periods, with dark zones in the early medieval period and bright hotspots for Classical Greece, Imperial Rome, and the modern era.
Figure 5. Rank heatmap of all five metrics across all 33 periods; warmer colors mean higher relative knowledge density.

One visible anomaly: the 1st century CE has an inflated ngram score. This is a data quality issue. The phrase first century appears frequently in non-historical contexts (the first century of the Industrial Revolution, the first century of American democracy), artificially boosting the signal. I'm flagging this rather than smoothing it away.

All Four Metrics Independently Agree

If academic papers, Wikipedia articles, surviving sources, and book mentions all point in the same direction, the overall pattern is robust. The scatter plots below show this directly. Each dot is one time period. When two metrics track each other, the dots line up along a diagonal. They do, consistently: Spearman rank correlations range from 0.78 to 0.93, all statistically significant after Benjamini-Hochberg correction.

This validates combining the metrics into a single composite index. They aren't telling four different stories. They are four witnesses to the same underlying phenomenon: the exponential erosion of historical knowledge over time.

Other researchers have approached the question with narrower scope. Candia et al. tracked how quickly specific cultural products (songs, movies, biographies) fade from online attention, finding a two-phase curve: a fast-decaying communicative component with a half-life of 20 to 30 years for biographies, and a slow-decaying cultural component that persists for centuries9. When Kestemont et al. applied mark-recapture methods from ecology to medieval manuscripts, they estimated that more than 90% of chivalric and heroic narratives from the period were lost entirely10. My composite index blends both layers of memory into a single curve, and the two regimes are no longer a speculation here: the best-fitting model resolves them directly, a fast component near 84 years and a slow one near 800.

How to read this chart: Each dot is one time period. When two metrics agree (both high or both low for the same periods), dots line up along a diagonal. The r value measures this agreement on a scale from 0 (no relationship) to 1 (perfect agreement).
Four log-log scatter plots showing pairwise Spearman correlations between historical knowledge metrics, ranging from 0.78 to 0.93, with each dot representing one time period.
Figure 6. Pairwise correlations between the four composite metrics (Spearman r 0.78 to 0.93).

Why This Matters

A one-century half-life is not just a statistical curiosity. It has practical consequences for how we build knowledge systems today.

Large language models are trained on text corpora that inherit this exponential gradient. The composite index shows that for any period more than a century in the past, the above-baseline portion of available documentation drops by half. For periods more than five centuries ago, it falls to a few percent of the modern level (about 2.6%, by the fitted curve). AI systems trained on these corpora do not merely reflect the biases of their curators. They absorb a structural decay that has been compounding for millennia. The model cannot flag what it does not know.

When an AI system answers a question about the Roman Republic, it draws on a corpus where more than 99.9% of the fast-fading, above-baseline layer of documentation and attention has already decayed away. What remains is the slow layer: the institutionalized residue with a half-life of roughly 800 years. The training data is dominated by what that filter happened to keep. The system cannot flag what it does not know. Neither can we, unless we measure the decay rate and account for it.

The finding also makes a concrete prediction. If the historical pattern holds, roughly half of what we currently document about the 2020s (the pandemic, the AI revolution, the wars) will have faded from active scholarship, Wikipedia coverage, and public discourse by the early 2120s. Not because the information is destroyed, but because collective attention shifts forward. This is what the decay curve shows for every previous era without exception.

That all four metrics decay at the same rate (Cronbach's alpha = 0.95) means this is not a problem confined to one domain. Archives, curricula, academic funding, and public discourse all participate in the same forgetting. Counter-decay strategies (digitization, open access, funding for ancient and medieval studies) need to explicitly account for an exponential gradient, not assume that the past preserves itself.

Objections and Responses

Every objection below has occurred to me, usually at 2am. Here is where each one lands.

Didn't you measure attention, not knowledge?

Largely conceded. What the composite tracks is present-day attention and documentation density: papers written, biographies maintained, mentions in books. I cannot distinguish knowledge decay from evidence destruction. Fewer named individuals from the 7th century BC may mean fewer records survived, fewer were translated, or fewer were deemed notable by later generations. Survivorship bias runs through every metric here; when Kestemont et al. applied unseen species models to medieval literature, they estimated more than 90% of chivalric and heroic narratives were lost entirely10. So read the half-life as a property of the record we can see. I would only add that the record we can see is exactly what an AI system trains on, which is why I think the measurement matters anyway.

Doesn't the half-life depend on the scale you fit on?

Yes, and the dependence is informative rather than embarrassing. Fitting on the raw scale gives you the fast end: 95 years for the single-exponential compromise, 84 for the pure fast component. Fitting the log-scale composite instead gives a half-life of roughly 800 years, which coincides with the slow component of the biexponential fit. The two scales are picking out the two regimes Candia et al. describe: communicative memory and cultural memory. Both numbers are reported, and the two-regime reading reconciles them.

This is an Anglophone dataset.

Conceded flatly. OpenAlex, English Wikipedia, the English Google Books corpus, and a source proxy weighted toward European archives. Chinese dynastic records, Islamic manuscript traditions, and Sanskrit corpora would trace their own curves, and I would genuinely like to see someone run this on them. The data and code are CC-BY for exactly that reason.

Your source proxy is stitched together and partly hand-estimated.

True. It splices inscription counts, manuscript production estimates, and print title counts across eras, and 16 of its 33 values are tier-3 manual estimates, each flagged per-value in the codebook. The strongest version of this objection is to drop the metric entirely, so I did: without the source proxy the half-life becomes 123 years. The conclusion that survives every variant is a fast layer of decades-to-a-century riding on an exponential family; if anything, removing my hand-estimated metric makes the decay slower, not faster.

Don't a few modern centuries drive the fit?

Less than I feared. Dropping each of the 32 periods one at a time and refitting gives half-lives from 92.5 to 103.6 years; the worst cases are dropping the 19th or 18th century, and the estimate never leaves the bootstrap interval. There is one analytic choice worth disclosing: standardizing the metrics on all periods rather than the 32 fitted ones moves the estimate to about 109 years. I report 95 because standardizing on exactly the rows you fit is the defensible default, but the difference tells you the second digit is soft.

Thirty-three data points, four correlated metrics.

Conceded. With inter-metric correlations of 0.78 to 0.93, I have something closer to one and a half independent witnesses than four, and a Cronbach's alpha of 0.95 is evidence that the metrics measure one coherent construct, not evidence of independent replication. Two of the six metrics originally collected were dropped for exactly this reason: the Wikipedia people count duplicated named individuals, and the events count was empty. I have tried to state the estimate at the precision the data supports, roughly a century rather than 95.25 years, and everything is published so that someone with better data can break it.

What Gets Lost

The past is not equally preserved. It fades in a measurable, predictable way. An exponential curve as clean as any in physics. And the institutions we build to study history don't fight this decay. They amplify it, pouring attention onto the centuries that already have the most sources while the deep past grows quieter.

Reich showed that DNA preserves what texts cannot. But even DNA decays. Recombination chops ancestral blocks smaller each generation. Drift erases rare lineages. Everything in the historical record follows the same pattern. The signal fades. That's the finding.

This analysis reflects Anglophone and Western historiographic traditions. Chinese dynastic records, Islamic manuscript traditions, and Indian literary heritage would trace different curves. The half-life of history isn't a universal constant. It's a measure of what one civilization chose to preserve and what it chose to study.

Methodology & Technical Details

Correction to the Original Estimate (June 2026)

The April 2026 version of this essay reported a half-life of 84 years (see the archived original). Two defects produced that figure. First, the model had no intercept: the exponential was forced to decay to zero even though the z-score composite has a floor near 0.33, and the misfit showed in a Durbin-Watson statistic of 0.17. Second, the standardization: each metric was z-scored using only the century-length periods, accidentally including the incomplete 21st century. The corrected pipeline fixes both. The table below shows the half-life under every combination of the two choices; the corrected notebook reproduces the published 84.0 exactly under the original specification.

SpecificationAll 33 periodsCentury rows (v1 published)32 fit periods (corrected)
No intercept (v1 model)85.284.0 (published)78.5
With intercept (corrected)109.1109.395.3 (corrected)

Fixing the model alone gives roughly 109 years; fixing the scaling alone gives 78.5; fixing both gives 95. One further note on the old uncertainty range: the published interval (23 to 101 years) came from the misspecified model and an unstructured bootstrap dominated by a few high-leverage points (its median, 71, sat far from the point estimate of 84). The old point lies outside the corrected interval of 87 to 107, which is the expected outcome of fixing a misspecified model, not evidence against the new fit.

Model Selection

Seven candidate models were fitted to the composite index for 32 time periods (the partial 21st century excluded), ranked here by AICc:

Biexponential + baseline: AICc = 155.9, R² = 0.994Exponential + baseline: AICc = 146.3, R² = 0.990Power law + baseline: AICc = 125.6, R² = 0.981Exponential (no baseline): AICc = 77.5, R² = 0.909Power law (no baseline): AICc = 63.9, R² = 0.861Logarithmic: AICc = 33.8, R² = 0.643Linear: AICc = 9.8, R² = 0.246

The biexponential with baseline fits best by AICc and resolves a fast component (half-life 84 years) and a slow component (roughly 800 years). The primary specification is the single exponential with baseline, y = a · eλt + c, with a = 8.28, λ = 0.00728, c = 0.288, giving a half-life of 95.3 years (ln 2 / λ). It sits about 10 AICc points behind the biexponential but uses two fewer parameters, a parsimony tradeoff I prefer at n = 32. The half-life is defined on the decaying part of the signal: how far back you go for the above-baseline portion of the index to halve. The composite itself never halves below the floor c.

Bootstrap Confidence Intervals

Because the residuals are serially correlated in time, the primary uncertainty estimate uses a moving-block residual bootstrap (block length b = 8, 10,000 resamples): residuals from the fitted exponential-with-baseline model are resampled in contiguous blocks, added back to the fitted curve, and the model is refit on each synthetic dataset. The 95% interval for the half-life is 87 to 107 years, with the bootstrap median (95.5) sitting on the point estimate, which indicates no leverage bias. Robustness runs with b = 5 and with iid residual resampling give materially the same interval. (The original version used an iid pairs bootstrap whose median, 71, sat far from its point estimate of 84, one of the symptoms of the misspecification described above.)

Half-life (point): 95.3 yearsHalf-life (95% CI): 87 to 107 yearsMethod: moving-block residual bootstrap, b = 8N periods: 33 (32 fitted)

Composite Index Construction

Four metrics were used in the composite index: OpenAlex academic work counts, Wikipedia named individuals, era-specific source proxy, and Google Books ngram discourse frequency. Two metrics were excluded: wikipedia_people_count (identical to named individuals for all 33 periods, both drawing from the same PetScan query) and wikipedia_events_count (null for all periods). Films set in each period are collected and displayed but treated as supplementary and not included in the composite. Each metric was z-scored using the mean and standard deviation (ddof = 1) of the 32 fitted periods, with the incomplete 21st century excluded from both the standardization and the fit. The arithmetic mean of available z-scores per period formed the composite index.

Bias Residual

The bias chart compares z-scored source availability against z-scored academic attention (OpenAlex works). The residual (attention minus source) indicates whether an era receives more or less scholarly focus than its surviving source base would predict. The chart and the prose cover the 32 fitted periods only: under the canonical standardization the partial 21st century produces a large negative artifact (z-source +13), so it is excluded as not-yet-history. The source proxy changes measurement regime across eras: cuneiform tablets and inscriptions for antiquity, manuscript production estimates (Buringh & Van Zanden 2009) for the medieval period, print title counts (USTC, Britannica) for the early modern era, and publication volume estimates for the modern period.

Correlation Analysis

Pairwise Spearman rank correlations were computed between the four metrics used in the composite. P-values were adjusted using the Benjamini-Hochberg FDR correction at α = 0.05. All pairs were statistically significant, with Spearman r values ranging from 0.78 to 0.93.

Data Sources

  • OpenAlex API (academic works, History concept C95457728)
  • PetScan / Wikipedia API (biography article counts)
  • Google Books Ngram Viewer (discourse frequency, en-2019 corpus)
  • Buringh & Van Zanden 2009 via Our World in Data (manuscript production)
  • USTC / ISTC (print edition counts)
  • Britannica / UNESCO (modern publication volumes)
  • EDH / PHI / CDLI (inscription/tablet databases)
  • Wikipedia Films categories (supplementary)

Sensitivity Analysis

To assess the stability of the half-life estimate, several sensitivity checks were performed:

Leave-one-metric-out: Dropping each of the four metrics in turn and refitting yields half-lives from 70.9 to 123.0 years. The estimate is most sensitive to dropping the source proxy (123.0) or named individuals (118.5) and least sensitive to dropping ngram discourse (70.9); R-squared stays between 0.98 and 0.99 in every variant.

Drop openalex: 74.1 yrDrop named_individuals: 118.5 yrDrop source_proxy: 123.0 yrDrop ngram_discourse: 70.9 yr

Leave-one-period-out: Dropping each of the 32 fitted periods in turn and refitting gives half-lives from 92.5 years (drop the 19th century) to 103.6 years (drop the 18th); every other drop lands between 93.6 and 96.0. No single period moves the estimate outside the bootstrap interval.

Exclude millennia: Removing the two aggregated millennia (which span 10x the time of century-level data) and refitting yields a half-life of 95.3 years (N=30), virtually identical to the full estimate.

Exclude 1st century CE: Removing the 1st century (whose ngram value is inflated by non-historical usage) yields 96.0 years (N=31), essentially unchanged.

Median composite: Using the median instead of the mean to aggregate z-scores yields a half-life of 95.7 years, within the confidence interval.

Log-scale composite: Fitting the log-scale composite gives a half-life of roughly 818 years (R² = 0.86), coinciding with the slow component of the biexponential fit rather than the fast layer.

Residual Diagnostics

Diagnostic tests on the residuals of the primary (exponential-with-baseline) model:

Durbin-Watson: 1.17Shapiro-Wilk p: 0.37Goldfeld-Quandt ratio: 1.24 (p = 0.34)

The Durbin-Watson statistic of 1.17 (where 2.0 indicates no autocorrelation) shows mild residual autocorrelation, a large improvement over the misspecified original. As a legacy exhibit: the published fit's DW was 0.17, and refitting the no-intercept model on the corrected composite gives 0.16; both are textbook misspecification. Normality is no longer rejected (Shapiro-Wilk p = 0.37) and there is no evidence of heteroscedasticity (Goldfeld-Quandt ratio 1.24, p = 0.34). Two periods exceed two standard residuals: the 1st century CE at +2.45 (above the curve) and the 18th century at 2.38 (below it).

Scale Reliability

To assess whether the four metrics measure a coherent underlying construct:

Cronbach's α: 0.954PCA PC1 variance: 87.8%

Cronbach's alpha of 0.954 far exceeds the conventional threshold of 0.7 for acceptable internal consistency. The first principal component explains 87.8% of variance across the four metrics, confirming that a single latent dimension (plausibly historical knowledge density) accounts for most of the shared variation. Both statistics are computed on the 32 fitted periods; the originally published values (alpha 0.879, PC1 74.8%) were computed under the old standardization basis.

Data downloads: data.csv, data.json, codebook. Released under CC-BY-4.0. The full dataset, analysis notebook, and results are also archived with a citable DOI on OSF: 10.17605/OSF.IO/TY5G9.

References

  1. Reich, D. (2026). Your ancestors aren't who you think they are [Interview]. Big Think. https://www.youtube.com/watch?v=a0uKLW07Jlg
  2. Haak, W., et al. (2015). Massive migration from the steppe was a source for Indo-European languages in Europe. Nature, 522, 207211. doi:10.1038/nature14317
  3. Lazaridis, I., et al. (2014). Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature, 513, 409413. doi:10.1038/nature12736
  4. Buringh, E. & Van Zanden, J.L. (2009). Charting the Rise of the West: Manuscripts and Printed Books in Europe, A Long-Term Perspective. Journal of Economic History, 69(2), 409445. doi:10.1017/S0022050709000837
  5. Arbesman, S. (2012). The Half-Life of Facts: Why Everything We Know Has an Expiration Date. Current/Penguin. penguinrandomhouse.com
  6. Michel, J.B., et al. (2011). Quantitative Analysis of Culture Using Millions of Digitized Books. Science, 331, 176182. doi:10.1126/science.1199644
  7. Priem, J., Piwowar, H. & Orr, R. (2022). OpenAlex: A fully-open index of scholarly works, authors, venues, institutions, and concepts. arXiv:2205.01833. doi:10.48550/arXiv.2205.01833
  8. Assmann, J. (1995). Collective Memory and Cultural Identity. New German Critique, 65, 125133. doi:10.2307/488538
  9. Candia, C., et al. (2019). The universal decay of collective memory and attention. Nature Human Behaviour, 3, 8291. doi:10.1038/s41562-018-0474-5
  10. Kestemont, M., et al. (2022). Forgotten books: The application of unseen species models to the survival of culture. Science, 375, 765769. doi:10.1126/science.abl7655
By Mark Pavlyukovskyy. Built with Claude Code. Data from OpenAlex, Wikipedia/PetScan, Google Ngrams, Our World in Data, USTC, EDH, PHI, CDLI. April 2026, updated June 2026.