A judicial opinion has never really been a single document. It is a node, a point where dozens of earlier rulings converge, are weighed, distinguished, extended, or buried, and from which dozens of later rulings will radiate. The text on the page is only the surface. Underneath sits a vast directed network of citations, statutes, and parties that no human reader has ever held in mind all at once. For most of the common law’s history, that network was invisible by necessity. Today, for the first time, it can be drawn.
The shift now under way in the courts ecosystem is not really about faster search. It is about representation. When precedent is modeled as an interactive graph rather than a list of results, the questions a judge or advocate can ask change shape. Instead of “find me cases like this one,” the question becomes “show me how authority on this issue has moved over forty years, which lines have weakened, and which obscure ruling secretly anchors them all.” That is a different kind of legal reasoning, and the technology to support it has finally caught up with the ambition.
The Old Way: Reading the Law as a Stack of Books
The traditional method of verifying whether a case is still “good law” is one of the great unsung labors of legal practice. Generations of lawyers were taught that failing to check a citation could itself be malpractice, and that the citators built to do this work, the printed and later digital indexes that flag whether a decision has been overruled, distinguished, or affirmed, were the profession’s safety net. The trouble is that the net has holes in it that few practitioners ever see.
When one researcher systematically compared the three leading commercial citators across 357 citing relationships, the results were unsettling. All three systems agreed that a case had received negative treatment only 53 times; in roughly 85% of the relationships examined, the citators disagreed on whether negative treatment had even occurred. Agreement on the precise subset of negative treatment fell to about 11%. The systems lawyers trusted to tell them whether a precedent was alive or dead were, much of the time, telling them different things.
The deeper problem was structural. A list-based citator can tell you that Case B cited Case A and tagged it “distinguished.” It cannot easily tell you that the distinguishing rationale was itself drawn from a third line of authority that an appellate court quietly abandoned a decade later, dragging the validity of the whole branch with it. Authority in the common law does not decay in straight lines. It decays through the network, and the network was never something a researcher could hold in view.
Empirical work on precedent makes the scale of that blind spot concrete. A study of how Supreme Court opinions are cited over time found that precedents depreciate about 81% in citation frequency between their first and twentieth years at the Supreme Court, and about 85% at the courts of appeals. A precedent is rarely overturned in a single dramatic ruling; far more often it simply stops being cited, its authority eroding silently while the books that contain it sit unchanged on the shelf.
How Fast Precedent Fades
Estimated decline in the probability a case is cited, by years since decision
Illustrative depreciation curves anchored to measured endpoints: citation likelihood falls roughly 81% by year 20 at the Supreme Court and 85% at the courts of appeals. Source: Black & Spriggs, The Citation and Depreciation of U.S. Supreme Court Precedent, Journal of Empirical Legal Studies (2013).
The Shift: When the Citation Graph Became Computable
What changed was the arrival of comprehensive, machine-readable maps of the law. The intellectual groundwork was laid by legal scholars who began treating bodies of case law not as collections of documents but as networks, directed graphs in which each opinion is a vertex and each citation an edge pointing backward in time. The ambition was old; the data finally made it real.
One landmark analysis assembled and studied citation networks of remarkable size: a Supreme Court network of 27,885 opinions linked by 235,881 citations, a federal appellate network of 959,985 cases joined by more than 6.6 million citations, and, behind both, an open database of over 3 million court opinions connected by more than 25 million citations across 400-plus jurisdictions. The structure of these networks turned out to carry information that no headnote captures. Citation ages in the Supreme Court network are heavily skewed, with a median age of just fourteen years, and newer cases are systematically more likely to be cited than older ones, quantitative confirmation of the intuition that the law is restless.
| Network | Opinions (nodes) | Citations (edges) | Coverage |
|---|---|---|---|
| U.S. Supreme Court | 27,885 | 235,881 | 1791 to 2016 |
| Federal courts of appeals | 959,985 | 6,649,916 | 13 federal circuits |
| Open multi-jurisdiction corpus | 3,000,000+ | 25,000,000+ | 400+ jurisdictions |
These maps did more than describe the past. They proved predictive. The same research found that a decision’s position in the network, how richly it is grounded in prior authority, and how recently it has been cited, forecasts its future influence better than raw citation counts, with time-aware metrics outperforming time-agnostic ones. Separately, an analysis of overruled Supreme Court decisions showed that doomed rulings are not random: cases that go on to be overruled tend to occupy more central network positions and to depreciate more slowly than comparable peers, behaving like “bad law” in measurable ways before any court formally pronounces them dead. The network, in other words, can sometimes see the obituary coming.
A precedent rarely dies in a single ruling. It decays through the network, and for the first time, that decay can be watched in real time rather than discovered too late.
From these academic foundations grew a new class of interactive systems. Knowledge-graph platforms in this space ingest opinions, statutes, judicial orders, legislation, and the parties themselves as distinct node types, linked by typed relationships, cites, distinguishes, involves, applies-section. Researchers building such graphs from court repositories have demonstrated the practical payoff: graph neural networks trained on a legal knowledge graph improved the prediction of citation links from a ROC-AUC of 0.587 to 0.725 once domain features and legal language models were layered in, and the same structures power similar-case recommendation, link prediction, and faceted semantic search across hundreds of thousands of entities.
Teaching a Graph to Read Precedent
Model accuracy (ROC-AUC) on legal knowledge-graph tasks as domain knowledge is added
Relational graph neural network performance on citation-link prediction and case-similarity tasks over a legal knowledge graph of 329,179 entities. Source: Dhani et al., Similar Cases Recommendation using Legal Knowledge Graphs, SAIL (2023), arXiv:2107.04771.
What It Looks Like Now: Authority You Can Steer
For a judge or an advocate, an interactive precedent model reframes the daily work. A litigator preparing a brief no longer asks merely for cases on point; she traces a chain of authority visually, watching how a doctrine was assembled, where rival circuits diverged, and which node carries the most weight in the line she intends to rely on. A clerk vetting a draft opinion can surface the hidden third-party connections, a recurring expert, an upstream statute, a quietly influential dissent, that a linear search would never connect. The graph turns “is this still good law?” from a binary lookup into a navigable terrain.
Adoption inside the courts themselves is real but uneven, and the data shows a profession moving carefully. In the first random-sample survey of U.S. federal judges, conducted by a Northwestern University research team and released in 2026, more than 60% of responding judges reported using at least one AI tool in their judicial work, though only 22.4% used such tools weekly or daily and about 38% had never used any of the listed tools at all. Where judges did reach for these systems, legal research dominated: 30% used AI for legal research and 39.8% reported staff in their chambers doing so, with judges favoring legal-specific tools integrated into established research platforms over general-purpose chatbots.
| Measure | Share of responding judges |
|---|---|
| Used at least one AI tool in judicial work | > 60% |
| Use AI tools weekly or daily | 22.4% |
| Never used any listed AI tool at work | 38.4% |
| Use AI to conduct legal research | 30.0% |
| Chambers staff use AI for legal research | 39.8% |
| Say no AI training was provided | 45.5% |
At the state and local level, where most litigation actually happens, the picture is more cautious still. A 2025 survey of U.S. state courts by the Thomson Reuters Institute and the National Center for State Courts found that while 55% of court professionals expect AI to have a “transformative” or “high” impact, just 17% of courts had instituted any generative-AI capability and more than two-thirds said AI was not allowed for court business at all, a figure that climbs above 80% for county and municipal courts. The same survey found roughly two-thirds of courts struggling with staff shortages and only about half of professionals reporting enough time to do their jobs, the precise conditions under which a well-built precedent map earns its keep.
Where Courts Stand on AI
Selected findings from a 2025 survey of U.S. state courts
Percent of court-professional respondents. Source: Thomson Reuters Institute & National Center for State Courts, Staffing, Operations and Technology: A 2025 Survey of State Courts.
The Trust Problem: Interpretability and Over-Reliance
The promise of a living precedent network is inseparable from its central risk. A graph that surfaces hidden connections is only as trustworthy as the inferences it draws, and the broader wave of generative legal tooling has given courts ample reason for caution. A peer-reviewed Stanford study of leading purpose-built legal research tools found they hallucinated between 17% and 33% of the time, far better than general-purpose chatbots, which earlier Stanford work found erred on 69% to 88% of specific legal queries, but still often enough that every output demands verification.
The consequences are no longer hypothetical. A widely cited public database tracking court decisions in which a party relied on fabricated AI-generated material recorded roughly 200 cases in mid-2025, rising to 719 by January 2026 and 1,598 by June 2026, an acceleration to nearly eight new cases a day. Sanctions have escalated in parallel, from a $5,000 fine in the first prominent matter to combined penalties approaching $109,700 in a single later case, alongside revoked admissions and multi-year suspensions.
The Cost of Unverified Authority
Cumulative documented court decisions involving AI-fabricated citations, worldwide
Documented cases in which a court found or implied reliance on hallucinated material. Source: Damien Charlotin AI Hallucination Cases database, as compiled in audits through June 2026.
This is exactly why a graph-based representation, properly built, is more than a cosmetic upgrade. Where a generative summary invites a reader to trust an opaque assertion, an interactive model invites them to inspect the path: the edges between two cases, the chain of citations that connects a holding to its support, the statutory section that links a party to a doctrine. The most useful systems in this space already display the most prominent path between two related cases and let a user click through to the underlying citing text. Interpretability is not a bonus feature here; it is the difference between a tool a court can rely on and one a judge must forbid.
Two Ways to Be Wrong About the Law
Error and disagreement rates across legal verification methods (%)
Hallucination rates from Stanford RegLab testing of legal research tools and general-purpose models; citator figure is the share of citing relationships on which three commercial citators disagreed about negative treatment. Sources: Stanford RegLab (2024); Hellyer, Law Library Journal (2018).
The Next Few Years: From Map to Instrument
Over the next three to seven years, the trajectory points away from precedent graphs as a research convenience and toward them as a standing instrument of judicial infrastructure. Three developments seem most likely. First, citation networks and knowledge graphs will increasingly serve as the verification layer beneath generative tools rather than competing with them, a model that drafts a passage will be checked against the graph to confirm that every cited authority exists, is correctly characterized, and remains good law before the text ever reaches a clerk. Grounding generation in an explicit, inspectable network is the most credible answer to the hallucination problem documented above.
Second, the “good-law / overruled” question will shift from retrospective lookup to continuous monitoring. Because network position predicts both future influence and the slow decay that precedes overruling, courts and advocates will plausibly receive early-warning signals, alerts that a relied-upon line of authority is weakening across jurisdictions before any single ruling formally disturbs it. The half-life of precedent, measurable but invisible for two centuries, becomes a dashboard metric.
Third, adoption will be gated less by capability than by governance. The federal judges’ survey found 45.5% reporting no AI training at all and judges nearly evenly split between optimism and concern, with explicit worries about hallucinations, “zombie cases,” and skill atrophy; the state-courts survey found more than one-third citing over-reliance on technology as a leading concern. The courts that move fastest will be those that pair interactive models with mandatory verification protocols, audit trails, and training, treating the graph as a transparent aid to human judgment rather than a substitute for it.
Conclusion
The arc from the bound reporter to the interactive knowledge graph is, at bottom, a story about what the law lets us see. The old way hid the network behind the document, leaving lawyers to reconstruct connections by memory and luck, and citators to disagree quietly about whether a case still stood. The present moment has made the network computable, predictive, and navigable, while also exposing, in a rising tide of sanctioned fabrications, exactly what happens when the connections are imagined rather than verified. The future belongs to the systems that hold both truths at once: that the law is a living network worth mapping, and that every edge in that map must be one a human can trace, question, and trust.
Sources
- Iain Carmichael, James Wudel, Michael Kim & James Jushchuk, “Examining the Evolution of Legal Precedent Through Citation Network Analysis,” 96 N.C. L. Rev. 227 (2017). https://scholarship.law.unc.edu/cgi/viewcontent.cgi?httpsredir=1&article=5717&context=nclr
- Ryan C. Black & James F. Spriggs II, “The Citation and Depreciation of U.S. Supreme Court Precedent,” Journal of Empirical Legal Studies (2013). https://onlinelibrary.wiley.com/doi/10.1111/jels.12012
- Ryan Whalen, “Bad Law Before it Goes Bad: Citation Networks and the Life Cycle of Overruled Supreme Court Precedent” (2012). https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2101273
- Jaspreet Singh Dhani et al., “Similar Cases Recommendation using Legal Knowledge Graphs,” SAIL (2023), arXiv:2107.04771. https://arxiv.org/html/2107.04771v2
- Northwestern University, “Federal judges report broad adoption of AI tools” (March 2026). https://news.northwestern.edu/stories/2026/03/northwestern-study-finds-a-significant-number-of-federal-judges-are-already-using-ai-tools
- New York City Bar Association, “Artificial Intelligence in Federal Courts: A Random-Sample Survey of Judges” (March 2026). https://www.nycbar.org/reports/artificial-intelligence-in-federal-courts-a-random-sample-survey-of-judges/
- Thomson Reuters Institute, “Courts remain slow to AI adoption” (June 2025), reporting the TRI & NCSC 2025 State Courts Survey. https://www.thomsonreuters.com/en-us/posts/ai-in-courts/courts-slow-ai-adoption/
- Thomson Reuters Institute & National Center for State Courts, “2025 State Courts Survey” (report PDF). https://www.thomsonreuters.com/en-us/posts/wp-content/uploads/sites/20/2025/05/2025-State-Courts-Survey.pdf
- Kristina Niedringhaus reviewing Paul Hellyer, “Evaluating Shepard’s, KeyCite, and BCite for Case Validation Accuracy,” 110 Law Libr. J. 449 (2018). https://lex.jotwell.com/is-it-a-good-case-can-you-rely-on-bcite-keycite-and-shepards-to-tell-you/
- Paul Hellyer, “Evaluating Shepard’s, KeyCite, and BCite for Case Validation Accuracy” (PDF). https://www.aallnet.org/wp-content/uploads/2018/12/LLJ_110n4_02_hellyer.pdf
- Varun Magesh et al. / Stanford RegLab, “Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools” (2024). https://reglab.stanford.edu/publications/hallucination-free-assessing-the-reliability-of-leading-ai-legal-research-tools/
- Stanford Law School / RegLab, “Hallucinating Law: Legal Mistakes with Large Language Models are Pervasive” (January 2024). https://law.stanford.edu/2024/01/11/hallucinating-law-legal-mistakes-with-large-language-models-are-pervasive/
- HAQQ, “AI Hallucination Cases: The Sanctions Tracker” audit of the Damien Charlotin database (June 2026). https://haqq.ai/blog/ai-legal-hallucination-audit
- Administrative Office of the U.S. Courts, “Federal Judicial Caseload Statistics 2025” / Judicial Caseload Indicators. https://www.uscourts.gov/data-news/reports/statistical-reports/federal-judicial-caseload-statistics/judicial-caseload-indicators-federal-judicial-caseload-statistics-2025
- Cambridge Core / Political Analysis, “Generative Dynamics of Supreme Court Citations: Analysis with a New Statistical Network Model” (2021). https://www.cambridge.org/core/journals/political-analysis/article/generative-dynamics-of-supreme-court-citations-analysis-with-a-new-statistical-network-model/3194E30BCCF421A5360389A13029EEB8
