CPG - Console & PC Gaming
  • Home
  • News
  • PC
  • PS5
  • Xbox
  • Switch
  • Mobile
  • Reviews
  • Guides
    • Deadlock Hero GuidesNew
    • Gray Zone Warfare GuidesNew
      • Artisan
      • Banshee
      • Gunny
      • Handshake
      • Lab Rat
      • Turncoat
    • Escape From Tarkov GuidesNew
      • Jaeger
      • Mechanic
        • Gunsmith
      • Peacekeeper
      • Prapor
      • Ragman
      • Skier
      • Therapist
No Result
View All Result
CPG - Console & PC Gaming
  • Home
  • News
  • PC
  • PS5
  • Xbox
  • Switch
  • Mobile
  • Reviews
  • Guides
    • Deadlock Hero GuidesNew
    • Gray Zone Warfare GuidesNew
      • Artisan
      • Banshee
      • Gunny
      • Handshake
      • Lab Rat
      • Turncoat
    • Escape From Tarkov GuidesNew
      • Jaeger
      • Mechanic
        • Gunsmith
      • Peacekeeper
      • Prapor
      • Ragman
      • Skier
      • Therapist
No Result
View All Result
CPG - Console & PC Gaming
No Result
View All Result
Home PC

OpenAI’s Latest GPT Models Show Rising Hallucination Rates, Causes Remain a Mystery

OpenAI’s newest GPT versions hallucinate more often, puzzling researchers and users alike.

Mihaela Kicevski by Mihaela Kicevski
May 6, 2025
in PC
0

OpenAI’s newest GPT models are struggling more than ever with hallucinations, making up false information at rates that have increased compared to earlier versions. OpenAI’s tests revealed this surprising trend, leaving many wondering why it’s happening.

The New York Times reports that OpenAI’s GPT-3 and GPT-4-mini models hallucinate far more than the older GPT-1. For example, in the PersonQA benchmark, which tests answering questions about public figures, o3 hallucinated 33% of the time, more than double the 15% hallucination rate of o1. The o4-mini model did even worse, hitting 48% hallucination.

When tested with SimpleQA, which covers more general questions, hallucination rates jumped to 51% for o3 and a staggering 79% for o4-mini, compared to 44% for o1. These numbers are pretty wild, especially since newer models are expected to be more innovative and accurate.

OpenAI admits it doesn’t fully understand why these newer models hallucinate more. Some experts think it might be linked to the so-called “reasoning” models, which try to break down problems step-by-step, mimicking human thought processes. These models are designed to handle complex tasks better than simple text prediction.

OpenAI’s first reasoning model, o1, was praised for matching or beating PhD students in physics, chemistry, biology, math, and coding. It uses a “chain of thought” approach, thinking through problems carefully before answering.

Despite this, OpenAI’s Gaby Raila told the Times that hallucinations aren’t inherently worse in reasoning models, though they are working on reducing the high hallucination rates seen in o3 and o4-mini.

AI models need to reduce nonsense and falsehoods if they want to be truly useful. Right now, it’s tough to trust their answers without double-checking everything. That defeats the purpose of saving time or effort, which is the main reason people turn to AI in the first place.

We’ll have to wait and see if OpenAI and other AI developers can get these hallucinations under control. Until then, it’s a wild ride with AI that sometimes just makes stuff up.

What do you think about these rising hallucination rates? Have you noticed weird or false answers from AI lately? Drop your thoughts in the comments below.

Tags: ChatGPTOpenAI
ShareTweet
Previous Post

Xbox Game Pass May 2025 Wave 1 Adds DOOM: The Dark Ages, Revenge of the Savage Planet, and More

Next Post

Elden Ring Tarnished Edition for Nintendo Switch 2 Adds New Content and Features

Mihaela Kicevski

Mihaela Kicevski

I am the Angel's and Margarita's daughter, and I am happy to be starting this work together with my parents! I just love writing about games and stuff!

RELATEDPOSTS

News

OpenAI’s Sam Altman Finds It ‘Cool’ That Young People Rely on ChatGPT for Life Advice

May 18, 2025
News

The International 7 Biggest Surprise: The Unbeaten OpenAI Bot

August 12, 2017

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest

Battlestate Games shares results from the in-game Survey about the Flea Market

January 30, 2025

Escape From Tarkov 2025 Roadmap Revealed, Full Release Finally Confirmed

April 18, 2025
Tarkov patch 0.15

Escape From Tarkov reveals the 0.15 trailer before wipe

August 14, 2024

Minecraft Update 1.21.21 Patch Notes for August 14/15

August 14, 2024

Escape From Tarkov Best Graphics Settings – Updated With Patch 0.15.5

41

Escape From Tarkov: How to Snipe Flea Market Items Easily?

32

CoD: Warzone Season 2 Update Fixes Plenty of Bugs

28

MW2 and Warzone 2.0 Season 3 is full of Bugs and Issues, Upcoming Fixes and more

19

Gray Zone Warfare Patch 0.3.1.0 Brings Over 100 Fixes and Adjustments

May 29, 2025
Fallout 4

New Fallout 4 Mod Unveils Massive Connecticut World to Conquer

May 29, 2025

Elden Ring Nightreign Patch 1.01 Hits with Gameplay Tweaks and New Content

May 29, 2025

Intel Extreme Masters Moves from Katowice to Kraków After 13 Years

May 29, 2025

CPGPATCH NOTES

Patch Notes

Gray Zone Warfare Patch 0.3.1.0 Brings Over 100 Fixes and Adjustments

by Margarita Kicevski
May 29, 2025
News

Elden Ring Nightreign Patch 1.01 Hits with Gameplay Tweaks and New Content

by Mihaela Kicevski
May 29, 2025
Patch Notes

Armored Core VI Fires of Rubicon Patch 1.09.1 Brings Weapon and Frame Tweaks

by Margarita Kicevski
May 29, 2025

About Us

We are CPG - Console & PC Gaming, an independent, family-run website providing fresh news, updates, reviews, interviews, guides, and other bits and pieces from the gaming industry.

Read more

  • About
  • Privacy Policy
  • Contact

© 2024 CPG - Console & PC Gaming

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • News
  • PC
  • PS5
  • Xbox
  • Switch
  • Mobile
  • Reviews
  • Guides
    • Deadlock Hero Guides
    • Gray Zone Warfare Guides
      • Artisan
      • Banshee
      • Gunny
      • Handshake
      • Lab Rat
      • Turncoat
    • Escape From Tarkov Guides
      • Jaeger
      • Mechanic
      • Peacekeeper
      • Prapor
      • Ragman
      • Skier
      • Therapist

© 2024 CPG - Console & PC Gaming