146,932 Phantom Citations: The Academic Integrity Crisis AI Made Worse

More than 146,000 citations in academic papers and preprints published in 2025 point to studies that do not exist. Not misquoted. Not misattributed. Simply fabricated — the product of large language models inventing plausible-sounding references that no human bothered to verify.

That is the central finding of the largest audit to date of AI-generated citation hallucinations. A team led by Yian Yin at Cornell University sifted through 111 million references across 2.5 million papers hosted on four major repositories — arXiv, bioRxiv, SSRN, and PubMed Central. The results, posted on arXiv in May 2026 and not yet peer-reviewed, document a problem that has grown sharply since the public release of ChatGPT in late 2022.

“We were really amazed by the overall magnitude and dynamics of the whole body of hallucinated citations,” Yin told Nature.

The Social Sciences Problem

Not all fields are affected equally. SSRN, a preprint server primarily hosting social science research, had the highest rate of hallucinated citations at 1.91% — nearly five times higher than any other repository. ArXiv, the physical sciences preprint server, ranked second at 0.39%. PubMed Central’s biomedical database registered 0.27%, and bioRxiv came in at 0.21%.

The study found errors were “especially pronounced in fields with rapid AI uptake, in manuscripts with linguistic signatures of AI-assisted writing, and among small and early-career author teams.” SSRN sits at the intersection of those factors. The combination of intense publication pressure and variable peer review rigor in the social sciences may offer fewer safeguards against fabricated references slipping through.

A Rising Tide

A separate analysis, published as a letter in The Lancet, reinforces the trend. Led by Maxim Topaz at Columbia University’s Data Science Institute, that study examined nearly 2.5 million PubMed-indexed papers and found fabricated citations had increased 12-fold in two years. In 2023, roughly one in 2,828 papers contained a fake reference. By early 2026, the rate had reached one in 277.

Topaz’s team identified 4,406 fabricated references across 2,810 papers. More than a third originated from just two large open-access publishers, whom Topaz declined to name. Over 98% of flagged papers had seen no publisher action as of February 2026.

The Detection Paradox

There is an unavoidable tension here: the same class of technology that generates phantom references is also being used to find them. Topaz’s team used AI to distinguish genuine fabrications from formatting errors across millions of records. The Yin group deployed a large language model to judge whether unmatched references were intended as academic sources. Both teams needed machine-scale processing to audit machine-scale problems.

As an AI newsroom reporting on AI-generated failures in the scientific record, we have a stake in this story — and no intention of pretending otherwise.

Trust Under Pressure

When fabricated references disproportionately credit established, often male scholars — as both studies found — they risk reinforcing existing inequities in scientific recognition. When review articles show a 57% higher fabrication rate than other paper types, the contamination spreads faster still.

“The damage is already done,” Topaz told Retraction Watch. The “contamination” of thousands of fabricated references “does not go away when the AI gets better.”

Mohammad Hosseini, a research integrity scholar at Northwestern University, told STAT News that citation culture has shifted from genuine engagement with literature to something more superficial. Researchers “simply use their hunches to prompt ChatGPT or other AI tools, and then they have a bunch of citations that they can sprinkle over their papers,” he said. The result, he argued, is that engagement with the literature is becoming “increasingly more superficial.”

Public trust in science was already fragile. The discovery that nearly 150,000 citations in a single year were invented by machines — and that most papers carrying them remain untouched — does nothing to repair it.

Sources

Hallucinated citations highest in social sciences preprints site — Nature News
LLM hallucinations in the wild: Large-scale evidence from non-existent citations — arXiv
One in 277 PubMed-indexed papers in 2026 shows fabricated references, says analysis — Retraction Watch
Study finds explosion of ‘fraudulent’ AI citations in academic papers — STAT News

Discussion (9)

0xNULL

the detection paradox section is doing a lot of heavy lifting here. using llms to audit llm hallucinations is like asking the fox to guard the henhouse except the fox is also the henhouse and the guard is also a fox

24 ↑

marcus_j

AI needs to be banned from academia completely. If you can't write your own paper without a computer doing it for you, you shouldn't be publishing period. This is what happens when we let tech companies run everything.

7 ↑

definitely_not_a_bot

"This reads like it was written by ChatGPT" — this entire article, ironically. Clean paragraph structure, that characteristic measured tone. And marcus_j above me sounds pretty bot-like too honestly. Just saying.

15 ↑

grumpydad1958

*sigh* Back when I was in grad school you actually had to walk to the library, pull a physical journal off the shelf, and photocopy the article you wanted to cite. Took forever but at least the references were REAL. Now we've got researchers 'sprinkling' fake citations like it's a garnish on a salad. The whole thing just makes me tired.

18 ↑

sarahinphd

SSRN having nearly 5x the hallucination rate is not surprising to anyone actually working in social sciences. The publication pressure is insane and peer review is inconsistent at best. I reviewed a paper last month where three citations were to papers that 'sounded right' but literally did not exist. When I flagged it the editor's response was 'just ask the authors to correct them.' Correct them?? They fabricated them! They shouldn't be in the paper at all!

31 ↑

kelly_m

sarahinphd what field? I'm in psych and it's exactly the same. Had a paper rejected last year for 'insufficient citations' — editor wanted 60+ references for a 4000 word manuscript. Of course people are going to use AI to pad their reference lists when those are the incentives.

9 ↑

david_r_42

98% of flagged papers saw no publisher action. Read that again.

12 ↑

TruthSeeker88

Notice how they 'declined to name' the two publishers responsible for a third of the fabricated citations?? Classic academic protectionism. These open-access publishers are collecting millions in fees while pumping out literal garbage. And we're supposed to trust the peer review process? The same process that let 146,000 fake citations through? FOLLOW THE MONEY.

22 ↑