Anthropic’s Promises Its New Claude AI Models Are Less Likely to Try to Deceive You ...Middle East

There are two new models, Claude Opus 4 and Claude Sonnet 4, and Anthropic says they're both "setting new standards" for what you can expect from AI. Coding is a big focus, and the models are said to have achieved the highest scores to date on two widely used AI coding benchmarking tools, SWE-bench and Terminal-bench. Claude 4 models can actually work for hours on projects without any user input, Anthropic says.

Away from code generation and analysis, the models also bring with them extended thinking, the ability to work on multiple tasks in parallel, and improved memory. They're better at integrating web searches as needed, and to check for supporting information and make sure they're on the right track with their answers.

New AI model launches usually come with benchmark charts showing improvements—and this one is no different. Credit: Anthropic

Anthropic is now making its Claude Code suite of tools available more generally as well, another step towards agentic AI that can work autonomously, without continuous help from flesh and blood users. In a demo video, Claude 4 models are shown compiling research papers from the web, putting together an online ordering system, and extracting information from documents to create actionable tasks.

Claude 4 is available now (but you'll need to pay for the more advanced model)

The path to releasing these Claude 4 models wasn't all smooth: Anthropic says its safety advice partner warned against releasing earlier versions of the models because of their tendency to "'scheme' and deceive." Those issues have now been worked out, apparently, but it's a reminder that as AI models get increasingly powerful, they also need to come with improved guardrails and safety features attached.

The new models are available inside Claude now. Credit: Lifehacker

To be honest, I'm always a bit stuck when it comes to how to make full use of AI chatbots and their latest upgrades. They can definitely save time when running certain web searches and researching topics online, but I don't fully trust the results, or AI's ability to decide what is relevant and what isn't—I'd still much rather do the reading and summarizing myself, even if it's slower.

There's a new Extended Thinking Mode you can make use of. Credit: Lifehacker

Anthropic isn't the only AI company with new models to tout. At Google I/O 2025 earlier this week, the company unveiled improved coding assistance and thought summaries in Gemini, following on from the announcement of its best AI models yet a few weeks ago. OpenAI, meanwhile, has been testing its GPT-4.5 model since February, touting improvements in coding and problem solving.

Read More Details
Finally We wish PressBee provided you with enough information of ( Anthropic’s Promises Its New Claude AI Models Are Less Likely to Try to Deceive You )

For more, read the news from the source

Also on site :

Anthropic’s Promises Its New Claude AI Models Are Less Likely to Try to Deceive You ...Middle East

Claude 4 is available now (but you'll need to pay for the more advanced model)

Walmart Is Selling 'Crisp' $43 Organic Cotton Sheets for Just $27, and Shoppers 'Love How They Feel'

EU state accuses Ukraine of spying

How to Make Coleslaw 10x Better, According to Legendary Chef Alice Waters

Crowd's Response to Howard Lutnick's Price Rise Question Goes Viral

Military base in paradise: Why the decolonization by the UK turns out fake again

Verdict expected in Kim Kardashian jewelry heist trial in Paris

International Insider: Cannes Closer; Is Stunt Industry Broken?; Mediawan’s Moment

Harvard sues Trump administration for blocking enrollment of foreign students

Trump threatens 25% tariffs on iPhones made outside the US

Hundreds of British students face expulsion from Harvard as ‘vindictive’ Trump bans overseas recruitment

Uche Ojeh, Husband of Today Co-Host Sheinelle Jones, Dead at 45 — Watch On-Air Announcement

When do you buy the tariff dip in US stocks? When Trump tells you

US stock markets lower to end the week as Trump picks a fight with the EU

Bowen Yang Explains Why He Was So Emotional During ‘SNL’ Finale Curtain Call: “Just Processing”

How to Get Verified on Bluesky

Beckham Family Drama Through the Years: Feuds, Rumored Affair and More

Here’s Where to Get Last-Minute 2025 American Music Awards Tickets Online

Commerce City woman injured by burning gas from glass jar addressed ‘to my enemies’

Hitting where it hurts: India targets Pak's terror funding, to approach World Bank & FATF

This weekend’s cartoon: Make iPhones Great Again

Suburban CosMc's restaurant to close as McDonald's teases new beverages

Mariners Designate Jesse Hahn For Assignment

Texas House bans THC products, reduces criminal penalty for possessing intoxicating hemp

One of the World’s Greatest Filmmakers Was Banned From Filmmaking. Now He’s Back With a Vengeance.

Eight People Found Guilty in High-Profile Kim Kardashian Robbery Trial

South Carolina tops UCLA softball in opening game of Super Regional

Score double discounts at Nordstrom Rack’s big Memorial Day sale