The Times Australia
The Times World News

.

how Google’s AI is losing touch with reality

  • Written by Toby Walsh, Professor of AI, Research Group Leader, UNSW Sydney
how Google’s AI is losing touch with reality

Google has rolled out its latest experimental search feature[1] on Chrome, Firefox and the Google app browser to hundreds of millions of users. “AI Overviews” saves you clicking on links by using generative AI — the same technology that powers rival product ChatGPT — to provide summaries of the search results. Ask “how to keep bananas fresh for longer” and it uses AI to generate a useful summary of tips such as storing them in a cool, dark place and away from other fruits like apples.

But ask it a left-field question and the results can be disastrous, or even dangerous. Google is currently scrambling to fix these problems one by one[2], but it is a PR disaster for the search giant and a challenging game of whack-a-mole.

Screenshots of Google AI Overviews recommending eating rocks and putting glue on pizza.
Google’s AI Overviews may damage the tech giant’s reputation for providing reliable results. Google / The Conversation

AI Overviews helpfully tells you that “Whack-A-Mole is a classic arcade game where players use a mallet to hit moles that pop up at random for points. The game was invented in Japan in 1975 by the amusement manufacturer TOGO and was originally called Mogura Taiji or Mogura Tataki.”

But AI Overviews also tells you that “astronauts have met cats on the moon[3], played with them, and provided care”. More worryingly, it also recommends “you should eat at least one small rock per day[4]” as “rocks are a vital source of minerals and vitamins”, and suggests putting glue in pizza topping[5].

Why is this happening?

One fundamental problem is that generative AI tools don’t know what is true, just what is popular. For example, there aren’t a lot of articles on the web about eating rocks as it is so self-evidently a bad idea.

There is, however, a well-read satirical article[6] from The Onion about eating rocks. And so Google’s AI based its summary on what was popular, not what was true.

Screenshots of results recommending putting gasoline in pasta and saying parachutes are ineffective. Some AI Overview results appear to have mistaken jokes and parodies for factual information. Google / The Conversation

Another problem is that generative AI tools don’t have our values. They’re trained on a large chunk of the web.

And while sophisticated techniques (that go by exotic names such as “reinforcement learning from human feedback[7]” or RLHF) are used to eliminate the worst, it is unsurprising they reflect some of the biases, conspiracy theories and worse to be found on the web. Indeed, I am always amazed how polite and well-behaved AI chatbots are, given what they’re trained on.

Is this the future of search?

If this is really the future of search, then we’re in for a bumpy ride. Google is, of course, playing catch-up[8] with OpenAI and Microsoft.

The financial incentives to lead the AI race are immense[9]. Google is therefore being less prudent than in the past in pushing the technology out into users’ hands.

In 2023, Google chief executive Sundar Pichai said[10]:

We’ve been cautious. There are areas where we’ve chosen not to be the first to put a product out. We’ve set up good structures around responsible AI. You will continue to see us take our time.

That no longer appears to be so true, as Google responds to criticisms[11] that it has become a large and lethargic competitor.

A risky move

It’s a risky strategy for Google. It risks losing the trust that the public has in Google being the place to find (correct) answers to questions.

But Google also risks undermining its own billion-dollar business model. If we no longer click on links, just read their summary, how does Google continue to make money?

The risks are not restricted to Google. I fear such use of AI might be harmful for society more broadly. Truth is already a somewhat contested and fungible idea. AI untruths are likely to make this worse.

In a decade’s time, we may look back at 2024 as the golden age of the web, when most of it was quality human-generated content, before the bots took over and filled the web[12] with synthetic and increasingly low-quality AI-generated content[13].

Has AI started breathing its own exhaust?

The second generation of large language models are likely and unintentionally being trained on some of the outputs of the first generation[14]. And lots of AI startups are touting the benefits of training on synthetic, AI-generated data[15].

But training on the exhaust fumes of current AI models risks amplifying even small biases and errors[16]. Just as breathing in exhaust fumes is bad for humans, it is bad for AI.

These concerns fit into a much bigger picture. Globally, more than US$400 million[17] (A$600 million) is being invested in AI every day. And governments are only now just waking up to the idea we might need guardrails and regulation to ensure AI is used responsibly, given this torrent of investment.

Pharmaceutical companies aren’t allowed to release drugs that are harmful. Nor are car companies. But so far, tech companies have largely been allowed to do what they like.

References

  1. ^ latest experimental search feature (blog.google)
  2. ^ fix these problems one by one (www.theverge.com)
  3. ^ astronauts have met cats on the moon (www.smh.com.au)
  4. ^ eat at least one small rock per day (www.reddit.com)
  5. ^ glue in pizza topping (x.com)
  6. ^ satirical article (www.theonion.com)
  7. ^ reinforcement learning from human feedback (huggingface.co)
  8. ^ playing catch-up (www.theinformation.com)
  9. ^ immense (www.bloomberg.com)
  10. ^ said (www.bloomberg.com)
  11. ^ Google responds to criticisms (stratechery.com)
  12. ^ filled the web (www.theatlantic.com)
  13. ^ AI-generated content (www.theguardian.com)
  14. ^ outputs of the first generation (www.newyorker.com)
  15. ^ synthetic, AI-generated data (www.nytimes.com)
  16. ^ risks amplifying even small biases and errors (www.theregister.com)
  17. ^ more than US$400 million (www.idc.com)

Read more https://theconversation.com/eat-a-rock-a-day-put-glue-on-your-pizza-how-googles-ai-is-losing-touch-with-reality-230953

Times Magazine

Building an AI-First Culture in Your Company

AI isn't just something to think about anymore - it's becoming part of how we live and work, whether we like it or not. At the office, it definitely helps us move faster. But here's the thing: just using tools like ChatGPT or plugging AI into your wo...

Data Management Isn't Just About Tech—Here’s Why It’s a Human Problem Too

Photo by Kevin Kuby Manuel O. Diaz Jr.We live in a world drowning in data. Every click, swipe, medical scan, and financial transaction generates information, so much that managing it all has become one of the biggest challenges of our digital age. Bu...

Headless CMS in Digital Twins and 3D Product Experiences

Image by freepik As the metaverse becomes more advanced and accessible, it's clear that multiple sectors will use digital twins and 3D product experiences to visualize, connect, and streamline efforts better. A digital twin is a virtual replica of ...

The Decline of Hyper-Casual: How Mid-Core Mobile Games Took Over in 2025

In recent years, the mobile gaming landscape has undergone a significant transformation, with mid-core mobile games emerging as the dominant force in app stores by 2025. This shift is underpinned by changing user habits and evolving monetization tr...

Understanding ITIL 4 and PRINCE2 Project Management Synergy

Key Highlights ITIL 4 focuses on IT service management, emphasising continual improvement and value creation through modern digital transformation approaches. PRINCE2 project management supports systematic planning and execution of projects wit...

What AI Adoption Means for the Future of Workplace Risk Management

Image by freepik As industrial operations become more complex and fast-paced, the risks faced by workers and employers alike continue to grow. Traditional safety models—reliant on manual oversight, reactive investigations, and standardised checklist...

The Times Features

Ricoh Launches IM C401F A4 Colour MFP to Boost Speed and Security in Hybrid Workplaces

Ricoh, a leading provider of smart workplace technology, today launched the RICOH IM C401F, an enterprise-grade A4 colour desktop multifunction printer (MFP) designed for Austral...

Why Diversification Still Matters in a Volatile Economy

Market volatility, geopolitical conflicts, inflation fears—these are only some of the wild cards that render the current financial environment a tightrope to walk. Amidst all thi...

Specialised nutrition gains momentum in supporting those living with early Alzheimer's disease

With high public interest in Alzheimer’s disease, there is growing awareness of the important role nutrition plays in supporting memory and cognitive function in people diagnosed...

From clinics to comfort: how sleep retreats are redefining care in Australia

Australia is amid a sleep health crisis. Nearly 40% of adults report inadequate sleep, and the consequences are far-reaching, impacting everything from cardiovascular health to...

Is our mental health determined by where we live – or is it the other way round? New research sheds more light

Ever felt like where you live is having an impact on your mental health? Turns out, you’re not imagining things. Our new analysis[1] of eight years of data from the New Zeal...

Going Off the Beaten Path? Here's How to Power Up Without the Grid

There’s something incredibly freeing about heading off the beaten path. No traffic, no crowded campsites, no glowing screens in every direction — just you, the landscape, and the...