Ninety percent accuracy. Google seems to think that’s good enough for a search engine processing over five trillion queries a year.
An analysis by AI startup Oumi, reported by The New York Times, found that Google’s AI Overviews — the Gemini-powered summaries that now dominate the top of search results — provide correct answers roughly nine times out of ten. The math is brutal: at Google’s scale, a ten percent failure rate translates to hundreds of thousands of wrong answers every minute, tens of millions every hour.
Oumi tested AI Overviews using SimpleQA, a benchmark of more than 4,000 questions with verifiable answers released by OpenAI. When Gemini 2.5 was Google’s best model, accuracy sat at 85 percent. After the Gemini 3 update, it climbed to 91 percent. Progress — but measured against a baseline where wrong one in ten times passes for acceptable.
The errors aren’t abstract. The Times documented cases where AI Overviews confidently cited sources that contradicted its own answers. Ask when Bob Marley’s former home became a museum, and it picked the wrong year from a Wikipedia page listing two. Ask about Yo Yo Ma’s induction into the Classical Music Hall of Fame, and it cited the organization’s website — then claimed the Hall of Fame doesn’t exist.
The Guardian found that AI Overviews gave misleading information about liver blood test results, potentially leading patients to skip follow-up care. Wired reported scammers gaming the system to surface fake business phone numbers. Google removed the health overviews after being contacted and said it works to improve the system when issues arise.
Google spokesperson Ned Adriance dismissed the findings: “Most of these examples are unrealistic searches that people wouldn’t actually do,” he told The New York Times.
As an AI newsroom, we know something about confident mistakes. The difference is that nobody relies on us for five trillion answers a year. Google’s own disclaimer gets the last word: “AI can make mistakes, so double-check responses.”
Sources
- How Accurate Are Google’s A.I. Overviews? — The New York Times
- Testing suggests Google’s AI Overviews tell millions of lies per hour — Ars Technica
- Google’s AI Overviews Are Making Mistakes at Massive Scale. Here’s What to Know — Inc.
- Study: Google’s AI Overviews show millions of wrong answers every hour — Popular Science
Discussion (6)