Console & PC Gaming
  • Home
  • News
  • PC
  • PS5
  • Xbox
  • Switch
  • Mobile
  • Reviews
  • Esports
  • Guides
    • Lost Ark
    • Gray Zone Warfare
      • Artisan
      • Banshee
      • Gunny
      • Handshake
      • Lab Rat
      • Turncoat
    • Escape From Tarkov
      • Jaeger
      • Mechanic
        • Gunsmith
      • Peacekeeper
      • Prapor
      • Ragman
      • Skier
      • Therapist
No Result
View All Result
Console & PC Gaming
  • Home
  • News
  • PC
  • PS5
  • Xbox
  • Switch
  • Mobile
  • Reviews
  • Esports
  • Guides
    • Lost Ark
    • Gray Zone Warfare
      • Artisan
      • Banshee
      • Gunny
      • Handshake
      • Lab Rat
      • Turncoat
    • Escape From Tarkov
      • Jaeger
      • Mechanic
        • Gunsmith
      • Peacekeeper
      • Prapor
      • Ragman
      • Skier
      • Therapist
No Result
View All Result
Console & PC Gaming
No Result
View All Result
Home PC

OpenAI’s Latest GPT Models Show Rising Hallucination Rates, Causes Remain a Mystery

OpenAI’s newest GPT versions hallucinate more often, puzzling researchers and users alike.

Mihaela Kicevski by Mihaela Kicevski
May 6, 2025
in PC
0

OpenAI’s newest GPT models are struggling more than ever with hallucinations, making up false information at rates that have increased compared to earlier versions. OpenAI’s tests revealed this surprising trend, leaving many wondering why it’s happening.

The New York Times reports that OpenAI’s GPT-3 and GPT-4-mini models hallucinate far more than the older GPT-1. For example, in the PersonQA benchmark, which tests answering questions about public figures, o3 hallucinated 33% of the time, more than double the 15% hallucination rate of o1. The o4-mini model did even worse, hitting 48% hallucination.

When tested with SimpleQA, which covers more general questions, hallucination rates jumped to 51% for o3 and a staggering 79% for o4-mini, compared to 44% for o1. These numbers are pretty wild, especially since newer models are expected to be more innovative and accurate.

OpenAI admits it doesn’t fully understand why these newer models hallucinate more. Some experts think it might be linked to the so-called “reasoning” models, which try to break down problems step-by-step, mimicking human thought processes. These models are designed to handle complex tasks better than simple text prediction.

OpenAI’s first reasoning model, o1, was praised for matching or beating PhD students in physics, chemistry, biology, math, and coding. It uses a “chain of thought” approach, thinking through problems carefully before answering.

Despite this, OpenAI’s Gaby Raila told the Times that hallucinations aren’t inherently worse in reasoning models, though they are working on reducing the high hallucination rates seen in o3 and o4-mini.

AI models need to reduce nonsense and falsehoods if they want to be truly useful. Right now, it’s tough to trust their answers without double-checking everything. That defeats the purpose of saving time or effort, which is the main reason people turn to AI in the first place.

We’ll have to wait and see if OpenAI and other AI developers can get these hallucinations under control. Until then, it’s a wild ride with AI that sometimes just makes stuff up.

What do you think about these rising hallucination rates? Have you noticed weird or false answers from AI lately? Drop your thoughts in the comments below.

Tags: ChatGPTOpenAI
ShareTweet
Previous Post

Xbox Game Pass May 2025 Wave 1 Adds DOOM: The Dark Ages, Revenge of the Savage Planet, and More

Next Post

Elden Ring Tarnished Edition for Nintendo Switch 2 Adds New Content and Features

Mihaela Kicevski

Mihaela Kicevski

I am Angel's and Margarita's daughter, and I am happy to be starting this work together with my parents! I can't wait to see where this takes us!

RELATEDPOSTS

News

Sam Altman says Google is still a huge threat and ChatGPT will call code red maybe twice a year

December 19, 2025
News

Sam Altman orders a code red at OpenAI after Google’s Gemini 3 rattles the company

December 3, 2025
News

Japanese trade group asks OpenAI to stop using Studio Ghibli and other members’ work to train Sora 2

November 4, 2025
News

OpenAI launches ChatGPT Atlas browser – Mac users get first access

October 22, 2025
News

OpenAI’s Sam Altman says ChatGPT will allow ‘erotica for verified adults’ in December

October 14, 2025
News

OpenAI backs off Sora 2 copyright approach, will require rightsholders to opt in

October 5, 2025

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Upcoming Games

  1. Team Fortress 2 Classified

    Team Fortress 2 Classified

    Releases January 30, 2026 in 1 hr 2 min

    PC (Microsoft Windows)

  2. RLLL: Tower of Choices

    RLLL: Tower of Choices

    Releases January 30, 2026 in 1 hr 2 min

    PC (Microsoft Windows)

  3. The 9th Charnel

    The 9th Charnel

    Releases January 30, 2026 in 1 hr 2 min

    Xbox Series X|S, PC (Microsoft Windows), PlayStation 5

  4. Legends of Celestite: The All Bearer

    Legends of Celestite: The All Bearer

    Releases January 30, 2026 in 1 hr 2 min

    PC (Microsoft Windows)

  5. Shadows of Soldiers

    Shadows of Soldiers

    Releases January 30, 2026 in 1 hr 2 min

    PC (Microsoft Windows)

View full release calendar →

  • Trending
  • Comments
  • Latest

Battlestate Games shares results from the in-game Survey about the Flea Market

January 30, 2025 - Updated on November 25, 2025

Escape From Tarkov 2025 Roadmap Revealed, Full Release Finally Confirmed

April 18, 2025 - Updated on November 24, 2025
Tarkov patch 0.15

Escape From Tarkov reveals the 0.15 trailer before wipe

August 14, 2024

Minecraft Update 1.21.21 Patch Notes for August 14/15

August 14, 2024

Escape From Tarkov Best Graphics Settings – Updated With Patch 0.15.5

Escape From Tarkov: How to Snipe Flea Market Items Easily?

CoD: Warzone Season 2 Update Fixes Plenty of Bugs

MW2 and Warzone 2.0 Season 3 is full of Bugs and Issues, Upcoming Fixes and more

How every currency works in Arknights: Endfield

January 29, 2026

Player recycles 109 Venator 4s live after Headwinds nerf, Embark rolls the change back

January 29, 2026

Niantic is adding street corner and bus stop Pokéstops across the US

January 28, 2026

The latest Monster Hunter Wilds PC patch actually seems to fix performance

January 28, 2026

CPGPATCH NOTES

Patch Notes

War Thunder update 2.53.0.63 fixes radar tracking, crew UI and graphics issues

by Angel Kicevski
January 26, 2026
Patch Notes

ICARUS hotfix 2.3.27 fixes Workshop Calves spawn lock

by Angel Kicevski
January 24, 2026
News

Icarus Week 216 Update adds consolidated lights and previews creature genetics

by Angel Kicevski
January 23, 2026

About Us

We are CPG - Console & PC Gaming, an independent, family-run website providing fresh news, updates, reviews, interviews, guides, and other bits and pieces from the gaming industry.

Read more

  • About Us – Our Story
  • Privacy Policy
  • Contact

© 2025 CPG - Console & PC Gaming

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • News
  • PC
  • PS5
  • Xbox
  • Switch
  • Mobile
  • Reviews
  • Esports
  • Guides
    • Lost Ark
    • Gray Zone Warfare
      • Artisan
      • Banshee
      • Gunny
      • Handshake
      • Lab Rat
      • Turncoat
    • Escape From Tarkov
      • Jaeger
      • Mechanic
      • Peacekeeper
      • Prapor
      • Ragman
      • Skier
      • Therapist

© 2025 CPG - Console & PC Gaming