OpenAI Promises the Next Model of ChatGPT Will Be Better at Reasoning ...Middle East

OpenAI CEO Sam Altman described o3 as "incredibly smart" in the video announcing the model, released as part of his company's "12 Days of OpenAI" promotion over the holiday season. The model is undergoing a variety of safety tests before it launches in full—first likely only for paying ChatGPT Plus users.

The o3 model is more than 20 percent better than the previous o1 model at coding, per the SWE-bench Verified benchmark, OpenAI says. It also scores strongly on math and science problems, at least according to benchmark tests—like o1, the o3 model is trained to think and reason before it answers, rigorously testing its responses for accuracy. OpenAI will also release a smaller, faster o3-mini model alongside the main update.

The pattern of completing squares with a darker blue square is simple for humans, but hard for AI—and it's a challenge that's part of ARC. Credit: ARC

This challenge gets AI to come up with new approaches to problems, rather than just relying on its memory, and involves a series of visual tasks for models to complete. They must match patterns in colored grids, exercises intended to be easy for people to complete without any training, but hard for AI to figure out.

Within the computing power boundaries of the ARC test, o3 scored 75.7%. That's way above the 5% achieved by the GPT-4o model, currently the best ChatGPT model available to free users. While we're still some way short of AGI (the model is still below human scores, and couldn't complete all the tasks), that's an impressive step up.

o1 and o1-mini are currently available to ChatGPT Plus users. Credit: Lifehacker

Predictably, OpenAI didn't talk about the energy demands of AI, the ethics of training AI on publicly available data that may be copyrighted, or the tendency for these models to hallucinate wrong answers—while mistakes should be fewer because of o3's extra thinking time, they won't be eradicated. What the company did mention is an expansion of its safety testing program, designed to prevent these models from being used for malicious purposes.

The ability for AI models to truly "think" or "reason"—or at least attempt some approximation of those human capabilities—will no doubt continue to be discussed as AI development progresses. Google has also just unveiled its Gemini 2.0 model, which brings with it improved reasoning.

Read More Details
Finally We wish PressBee provided you with enough information of ( OpenAI Promises the Next Model of ChatGPT Will Be Better at Reasoning )

For more, read the news from the source

Also on site :

OpenAI Promises the Next Model of ChatGPT Will Be Better at Reasoning ...Middle East

Awkward! Tom Brady Gets Booed At Indy 500

Fact-checking Trump’s claims about Medicaid cuts in GOP bill

Ukraine-Russia war latest: Trump envoy calls for ‘ceasefire now’ after Putin’s ‘shameful’ bombardment on Kyiv

'Situation is dire' - BBC returns to Gaza baby left hungry by Israeli blockade

Is Michaels Open on Memorial Day 2025?

International cyberbattle champions crowned at hack fest in Moscow

Aldi Is Selling Stylish $10 Bike Shorts Similar to Athleta and Free People Styles Almost 5x the Price

German e-mobility association files for bankruptcy – media

Update - Iran and US wrapped up a fifth round of nuclear talks - "constructive"

German politician urges talks on restoring Nord Stream

These are Dollar General's 2025 Memorial Day Hours

US citizen charged with trying to attack US embassy branch in Tel Aviv

Fed's Kashkari - Uncertainty top of mind for Fed, US businesses

Resilient Cal Poly staves off elimination 3 times to capture Big West Championship

5/25: CBS Weekend News

Ukraine-Russia war latest: Trump calls Putin ‘absolutely crazy’ after Moscow launches war’s largest air attack

Trump says Russian leader Putin 'has gone absolutely CRAZY!'

Taiwan to hold referendum on restarting closed nuclear reactor

Ohtani faces hitters for 1st time since elbow surgery

Do Japan's trees no longer occupy the sacred space they used to?

Společnost HONOR uvádí na trh řadu HONOR 400 s přední kamerou na bázi AI

Fallen Rochester Marine Corporal Reynold Armand honored at Coca-Cola 600 NASCAR race

Tom Brady Gives Fans a Glimpse of His Lock Screen at 'Favorite Time of Day'

IPL 2025: Klaasen’s ton, Malinga, and Dubey three-fers help SRH thrash KKR by 110 runs

Take a Gander at This! Trump Posts Pic of Friend Fighting Off a Grabby Golf Course Goose at Bedminster

Family looking for answers in hit-and-run that killed man celebrating his birthday

The Last of Us season 2 ending explained: Is [SPOILER] dead?