Stephane Budel
Home

The Diffusion Index

The NGS Diffusion Atlas

From Methods to Medicine

Next-generation sequencing did not just grow — it spread. It began as a methods-and-bioinformatics frontier and worked its way, application by application, into oncology, prenatal testing, liquid biopsy, and the rest of clinical medicine. This view traces that spread across 98,417 classified NGS publications, 20062025.

8% → 49%

Clinical & diagnostic share of NGS papers (2008 → 2025)

2.9 → 7.1

Effective applications — the breadth NGS is spread across (2008 → 2025)

10 / 11

Applications at mainstream scale (100+ papers in 2025)

98,417

NGS papers classified by application, 2006–2025

A scholarly diffusion view: it measures how NGS spread through the published research community, classified by application from each paper’s title and abstract. It is not a measure of clinical or commercial adoption. Data through June 2026; 2026 is partial and excluded from this view.

When each application lit up

A dot each time an application’s cumulative output crossed an order of magnitude — 1, 10, 100, 1,000, 10,000 papers — with a final dot at the present. Tightly-spaced rings mean explosive growth; stretched-out rings mean a slow climb. Lanes run from basic research at the top, through translational and clinical, down to applied markets — read top to bottom and the rings drift rightward: research arrived first, the clinical and diagnostic applications followed, most within a decade. Toggle the basis between cumulative totals and annual output and the rings reposition — the year a field’s running total passed 1,000 is not the year it first published 1,000 papers in a single year.

Rings mark
Academia / basic research
20,708
Population studies
4,123
Biopharma R&D
4,715
Biopharma clinical trials
944
Liquid biopsy — clinical trials
592
Clinical Dx — oncology
13,820
Liquid biopsy — diagnostics
2,156
Clinical Dx — NIPT / prenatal
1,383
Clinical Dx — other
20,416
Epidemiology / infectious dz
5,950
Other applications
23,610
20062010201520202025
cumulative

Each dot = the year cumulative output crossed an order of magnitude; the final dot is the all-time total.

Dot size:1101001k10k

From research into the clinic

The clearest single signal of NGS becoming infrastructure: the share of NGS publications doing clinical and diagnostic work climbed from 8% in 2008 to 49% by 2025. Meanwhile the number of effective applications — how many distinct uses the literature is meaningfully spread across — more than doubled, from 2.9 to 7.1.

“Clinical & diagnostic” combines clinical Dx (oncology, NIPT, other), liquid-biopsy diagnostics, and biopharma/liquid-biopsy clinical trials. “Effective applications” is the exponential of the Shannon entropy of the application mix (1 = a single use; 11 = perfectly even across all categories).

How fast each application diffused

Years from an application’s first NGS paper to its first year at mainstream scale. A short gap means the use case was waiting for the technology; a long one means the science (or the clinical evidence) had to catch up.

ApplicationFirst paperReached 100/yrYears to mainstream2025
Academia / basic research
2006200931,372
Population studies
200620115315
Other applications
2006201152,336
Clinical Dx — other
2006201262,688
Biopharma R&D
200720147535
Clinical Dx — oncology
2006201371,734
Epidemiology / infectious dz
200620148838
Biopharma clinical trials
201220219136
Liquid biopsy — clinical trials
20132022995
Liquid biopsy — diagnostics
2009201910273
Clinical Dx — NIPT / prenatal
2009201910168

From publications to dollars

The Atlas reads scholarly activity. Does it track the money? We also model the NGS market bottom-up — and the two instruments agree. Expressed in the same unit, the share of NGS that is clinical, publications (49% in 2025) and manufacturer revenue (~51%) have converged on roughly half. The publication curve is the 15-year climb that got there first.

Publication share from the Atlas. Revenue share from DeciBio’s NGS Market Report (11th ed., Dec 2025; figures as shared publicly in the Cowen investor presentation), which sizes the NGS manufacturer market — instruments, reagents, bioinformatics — at ~$7.3B in 2025 (clinical ≈ $3.7B) and excludes diagnostic service revenue. 2028 is projected from the report’s published clinical growth rates. Two different things (research activity vs. tool revenue), one shared axis.

Atlas signal

Clinical & diagnostic share of NGS papers rose from ~8% (2008) to 49% (2025).

Market confirms

Clinical is now ≈$3.7B of the ~$7.3B market and ~80% of all market growth to 2028.

Atlas signal

Liquid-biopsy applications emerged late (first papers 2009–2013) and are still climbing.

Market confirms

MRD / liquid biopsy is the fastest-growing demand — Natera Signatera grew 54% YoY to ~211k tests in Q3 2025.

Atlas signal

The research lanes (academia, population) are the oldest and now the slowest-growing.

Market confirms

Academia is ~25% of revenue but growing only ~1%/yr — research publishes, the clinic pays.

Atlas signal

Effective applications more than doubled (2.9 → 7.1) — NGS spread across many uses.

Market confirms

Multiple engines now drive demand — rare-disease WGS/WES (GeneDx >30% growth) and women’s health beyond NIPT.

And the value accrued to diagnostics, not tools: through Dec 2025, Guardant Health was up 221% and Natera up 52% on the year while Illumina was down 2% — even as sequencing data volumes grew 30–40%/yr and reagent revenue rose only ~3%/yr. The reads got cheaper; the clinical decisions they enable got more valuable. (DeciBio NGS Market Report / Cowen, Dec 2025.)

What this measures (and what it doesn’t)

Every NGS paper with an abstract is assigned an application area by a language model (Claude Haiku, via the Batch API) from its title and abstract. Because the classification reads the science itself — not author affiliations — this view carries none of the first-author or institution-resolution bias that an institution-level analysis would.

It is a measure of scholarly diffusion: where the published research community is putting NGS to work. It is a leading indicator, not a measure of clinical or commercial adoption — a field can be heavily published before it is widely reimbursed, and vice-versa. “Mainstream” here is a deliberately simple, transparent bar (100+ papers in a single year), chosen so the milestones are auditable rather than tuned. Papers that span several applications are assigned a primary area, so counts are conservative for genuinely cross-cutting work.

Deliberately out of scope (for now): institution- and author-level diffusion — who entered the field, collaboration networks, and bridge institutions — which the current dataset (first-author affiliation strings, no coauthorship) can’t support credibly. That would need a structured authorship source such as OpenAlex.