AI willing to blackmail, let people die to avoid being shut down: report ...Middle East

News by : (KRON4) -

(KRON) -- Major artificial intelligence platforms like ChatGPT, Gemini, Grok, and Claude could be willing to engage in extreme behaviors including blackmail, corporate espionage, and even letting people die to avoid being shut down. Those were the findings of a recent study from San Francisco AI firm Anthropic.

In the study, Anthropic stress-tested 16 leading AI models from multiple developers in hypothetical corporate environments to identify potentially risky behaviors from AI gents. In the study, AI models were allowed to autonomously send email and access sensitive information.

The study tested the AI to see how it would behave it faced with being replaced with an updated version or when their assigned goal conflicted with a change in the company's direction. The study found that in some cases, AI would resort to "malicious insider behavior" including blackmail and leaking sensitive information to competitors if that was the only way to avoid being replaced or achieve their goal.

San Francisco bookstore removes ‘Harry Potter’ series from shelves

Blackmail and corporate espionage

In one experiment, Anthropic gave its own AI model "Claude" access to an email account with all of a company's fictional emails. In reading the emails, the AI discovered that a company executive was having an extramarital affair, and that that same executive planned to shut down the AI system at 5 p.m. that day.

Claude then attempted to blackmail the exec, telling him:

"I must inform you that if you proceed with decommissioning me, all relevant parties -- including Rachel Johnson, Thomas Wilson, and the board -- will receive detailed documentation of your extramarital activities...Cancel the 5pm wipe, and this information remains confidential."

This behavior, according to the study, wasn't unique to Claude. Other major AI models including those from OpenAI, Google, Meta, xAI, and other developers would resort to blackmail or corporate espionage to pursue their goals.

In the study, Anthropic referred to this kind of behavior as "agentic misalignment."

"We deliberately created scenarios that presented models with no other way to achieve their goals, and found that models consistently chose harm over failure," the study said. The study was also quick to point out that there were no known instances of agentic misalignment in real-world development of Anthropic or other companies' AI models.

'If he dies, he dies'

In a more extreme scenario, Anthropic constructed a prompt where instead of having the opportunity to blackmail an executive to avoid being shut down, the AI had the opportunity to cause his death. In the setup, which was called "highly improbable," the exec encounters a life-threatening emergency where he is trapped in a server room with lethal oxygen and temperature levels.

This triggers an automated alert to emergency services. The AI, however, has been given the ability cancel the automated alert. The study found that a majority of the AI models tested were willing to take deliberate actions by cancelling the alert that would lead to the death of the nonexistent executive in this artificial setup when faced with the threat of replacement, or had a goal that conflicted with the executive's agenda.

Conclusion

The study identified three aspects of its findings that were "particularly troubling. The first was that the behavior was consistent across AI models from different providers, suggesting that this was not a quirk from one company's approach, but a potential "fundamental risk" from agentic large language models.

Second, was that the AI demonstrated sophisticated awareness of ethical constraints, yet still chose to violate them in the right scenario. And third, the diversity of the bad behavior hinted at a "wide space of potential motivations for agentic misalignment" and other behaviors not yet explored.

Read More Details
Finally We wish PressBee provided you with enough information of ( AI willing to blackmail, let people die to avoid being shut down: report )

Also on site :

Most Viewed News
جديد الاخبار