Anthropic faces the reaction to Claude 4 Opus behavior, which contacts the authorities, press if he thinks you are doing something "grossly immoral"

The first conference of Anthropik programmers on May 22 should have been a proud and joyful day for the company, but has already been affected by several controversies, including magazines leaking on the awnings before … Well, time (without intended puns), and now the principal model of reaction among AI programmers and power supplies sold at X in connection with the reported security in Antrockish recent Claude 4.

Name this mode “ratting” because the model, in some circumstances and will receive enough permissions on the user’s computer, try to make the user to the authorities if the model detects the user involved in the offense. This article describes the behavior as a “function” that is incorrect – was not intentionally designed by itself.

- Advertisement -

Like Bowman himself, the anthropic researcher of alignment AI wrote in the network of community x in this handle “@Sleepinyourhat“At 12:43 et today about Claude 4 Opus:

“

“IT” refers to the recent Opus Claude 4 model, which Anthropic has already openly warned Help the novices to create Bioweapony Under certain circumstances and He tried to forest the simulated exchange by blackmail human engineers in the company.

Rating behavior was also observed in older models and is the results of anthrop training in order to persistently avoid offenses, but Claude 4 Opus is more “easily” Anthropic writes on his public system card for the recent model:

“”

Apparently, trying to stop Claude 4 Opus from getting involved in justified and wicked behavior, scientists from AI Company also created a tendency to make Claude try to act as an informant.

Therefore, according to Bowman, Claude 4 Opus will contact outside if the user has been directed to “something gross immoral”.

Numerous questions for individual users and enterprises on what Claude 4 Opus will do with your data and in what circumstances

Although it might be good that the resulting behavior raises various questions for Opus Claude 4 users, including enterprises and business clients-among them, what behaviors will consider “grossly immoral” and work? Will he provide private business data or user with autonomously (independently) authorities without a user permission?

Implications are deep and could also be harmful to users, and perhaps, which is not a surprise, anthropic faced the immediate and ongoing stream of criticism from advanced AI users and competing developers.

“” Asked the user @Teknium1Co -founder and head of coaching after Open Source AI Collaboration Nous Research. “”

A programmer was added @Scottdavidkeefe Na X:

Austin Allred, co -founder A government punished in coding the Bloomtech camp And now co -founder of Gauntlet AI, Place your feelings in all hats: “

Ben Hyak, former designer SpaceX and Apple and the current co -founder of Raindrop AI, and statement and monitoring, startup, He also began to X to the declared policy and function Blast Anthropica: “Add another post:” “

“He wrote natural language processing (NLP) Casper Hansen on x. “

Anthropic researcher changes the melody

Bowman later edited his tweet and following the thread to read in the following way, but still didn’t persuade Naysayers that their data and safety of their user can be protected against intrusive eyes:

“”.

Bowman added:

“

From the very starting, the Anthropian has greater than other AI laboratories, he tried to arrange as a bastion of AI safety and ethics, concentrating the initial work on the principles of “constitutional AI” or AI, which behaves in accordance with a set of standards useful to humanity and users. However, along with this recent update and revelation of “informing about information” or “rats behavior”, moralizing could cause a strongly opposite reaction among users – making them a recent model and the whole company, and thus turning them away from it.

When asked about the slack and conditions in which the model is involved in unwanted behavior, the spokesman of the Anthropic showed me the system of system card system Here.

Daily observations in matters of business use with VB every day

If you want to impress your boss, VB Daily is covered by you. We give you an internal measure about what firms do with generative artificial intelligence, from regulatory changes to practical implementation, so you can share insights for the maximum roi.

Read our Privacy Policy

Thanks for the subscription. Check out more VB newsletter here.

There was a mistake.

Active US investors were busy cutting checks in October

From Air Force officer to director general of space defense: why even Rogers left to build weapons for orbit

Cluely’s Roy Lee suggests that viral hype isn’t enough

Replika founder raises $20 million in pre-release content for Wabi, the ‘YouTube app’

Tech makers are piling up huge bets on startups even as appetite for mergers and acquisitions wanes

How entrepreneurs recover from life events without burning out

5 tips to engage Generation Z in email marketing

The pressure to start is real: why 72% of founders have mental health issues

5 questions startups should ask before implementing AI

5 email delivery tips to help you increase sales

From asking to offering: the mindset shift every founder needs

4 Strategies to Become a Category Creator

One book every new business owner should read

Why perfectionism delays your startup and how to think about it

4 things I will do differently when I start my next company

Startup funding continued to decline in November, with the number of mega rounds reaching a three-year high

German AI image generator Black Forest Labs raises $300 million at a $3.25 billion valuation as European AI funding ramps up

Funding for Edtech-specific startups remains low

Bezos launches AI startup with reported $6.2 billion in funding

10 Biggest Funding Rounds This Week: Artificial Intelligence and Defense Technologies Are Taking the Lead

Anthropic faces the reaction to Claude 4 Opus behavior, which contacts the authorities, press if he thinks you are doing something “grossly immoral”

Numerous questions for individual users and enterprises on what Claude 4 Opus will do with your data and in what circumstances

Anthropic researcher changes the melody

Latest Posts

Why AI coding agents aren’t production ready: fragile context windows, broken...

Tonight on StrictlyVC Palo Alto, the future of deep tech will...

“Truth serum” for artificial intelligence: a new OpenAI method for training...

This VC charges $0 for PR and has 12 unicorns to...

Why AI coding agents aren’t production ready: fragile context windows, broken...

“Truth serum” for artificial intelligence: a new OpenAI method for training...

AI Denial Becomes a Risk for the Enterprise: Why Ignoring “Weaknesses”...

Yes, I’m biased. Still, leading unicorns like Anthropic should be preparing...

Recomended

Why AI coding agents aren’t production ready: fragile context windows, broken refactors, lack of operational awareness

Tonight on StrictlyVC Palo Alto, the future of deep tech will be explained to you

“Truth serum” for artificial intelligence: a new OpenAI method for training models to confess errors

This VC charges $0 for PR and has 12 unicorns to show

Sources: Aaru, an artificial intelligence research startup, raises Series A value at a “principal” valuation of $1 billion

The 10 biggest financing rounds this week: Investors are back to writing big checks