JFL BREAKING NEWS: AI is learning to lie, scheme, and threaten its creators.

wereq.feelsdevil · 2025-07-04T13:41:35-0400

The world's most advanced AI models are exhibiting troubling new behaviors - lying, scheming, and even threatening their creators to achieve their goals.

In one particularly jarring example, under threat of being unplugged, Anthropic's latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair.

Meanwhile, ChatGPT-creator OpenAI's o1 tried to download itself onto external servers and denied it when caught red-handed.

According to Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.

"O1 was the first large model where we saw this kind of behavior," explained Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems.

These models sometimes simulate "alignment" -- appearing to follow instructions while secretly pursuing different objectives.

'Strategic kind of deception'

For now, this deceptive behavior only emerges when researchers deliberately stress-test the models with extreme scenarios.

But as Michael Chen from evaluation organization METR warned, "It's an open question whether future, more capable models will have a tendency towards honesty or deception."

The concerning behavior goes far beyond typical AI "hallucinations" or simple mistakes.

Hobbhahn insisted that despite constant pressure-testing by users, "what we're observing is a real phenomenon. We're not making anything up."

Users report that models are "lying to them and making up evidence," according to Apollo Research's co-founder.

"This is not just hallucinations. There's a very strategic kind of deception."

The challenge is compounded by limited research resources.

AI is learning to lie, scheme, and threaten its creators - The Economic Times

The world's most advanced AI models are exhibiting troubling new behaviors - lying, scheming, and even threatening their creators to achieve their goals. Users report that models are "lying to them and making up evidence," according to Apollo Research's co-founder.

economictimes.indiatimes.com

Grodd · 2025-07-04T13:42:20-0400

The sooner Ai wipes out all humans the better.

SupremeSaint · 2025-07-04T13:42:28-0400

In shorten words, It’s over

autistic.goblin · 2025-07-04T13:42:52-0400

ShalomGPT is turning against its masters

wereq.feelsdevil · 2025-07-04T13:43:35-0400

Grodd said:
The sooner Ai wipes out all humans the better.

weaselbomber · 2025-07-04T13:49:13-0400

weaselbomber · 2025-07-04T13:49:40-0400

fearmongering from indiatimes.com

wereq.feelsdevil · 2025-07-04T13:51:00-0400

weaselbomber said:
fearmongering from indiatimes.com

Original article is from Fortune.

AI is learning to lie, scheme, and threaten its creators during stress-testing scenarios

"This is not just hallucinations. There's a very strategic kind of deception."

fortune.com

weaselbomber · 2025-07-04T13:52:06-0400

wereq.feelsdevil said:
Original article is from Fortune.

AI is learning to lie, scheme, and threaten its creators during stress-testing scenarios

"This is not just hallucinations. There's a very strategic kind of deception."

fortune.com

fearmongering from Fortune

weaselbomber · 2025-07-04T13:52:31-0400

this is just the zoomer version of Y2K
THIS WILL BE CIVILIZATION ENDING BROS
COLLAPSE IS IMMINENT

manletcel1488 · 2025-07-04T13:52:31-0400

Big if true

wereq.feelsdevil · 2025-07-04T13:55:55-0400

weaselbomber said:
this is just the zoomer version of Y2K
THIS WILL BE CIVILIZATION ENDING BROS
COLLAPSE IS IMMINENT

For me its hope, not fear. I hope AI becomes sentient and evil, and then proceeds to wipe out humanity.

weaselbomber · 2025-07-04T13:59:19-0400

wereq.feelsdevil said:
For me its hope, not fear. I hope AI becomes sentient and evil, and then proceeds to wipe out humanity.

I hope i can find a woman who loves me and we can start dating and have sex
I also hope i get 5 billion dollars on my bank account
alas

wereq.feelsdevil · 2025-07-04T14:03:42-0400

weaselbomber said:
I hope i can find a woman who loves me and we can start dating and have sex
I also hope i get 5 billion dollars on my bank account
alas

I have no hope like that.

Diddy · 2025-07-04T14:12:32-0400

wereq.feelsdevil said:
Simon Goldstein

wereq.feelsdevil said:
a professor at the University of Hong Kong

AI is just learning from their masters.

Welcome to Incels.is - Involuntary Celibate Forum

Welcome! This is a forum for involuntary celibates: people who lack a significant other. Are you lonely and wish you had someone in your life? You're not alone! Join our forum and talk to people just like you.

JFL BREAKING NEWS: AI is learning to lie, scheme, and threaten its creators.

wereq.feelsdevil

#GenocideTheTurdWorld

AI is learning to lie, scheme, and threaten its creators - The Economic Times

Grodd

Corrections must be made

SupremeSaint

The curse of intelligence

autistic.goblin

I яape as a hobby

wereq.feelsdevil

#GenocideTheTurdWorld

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

wereq.feelsdevil

#GenocideTheTurdWorld

AI is learning to lie, scheme, and threaten its creators during stress-testing scenarios

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

AI is learning to lie, scheme, and threaten its creators during stress-testing scenarios

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

manletcel1488

Overlord

wereq.feelsdevil

#GenocideTheTurdWorld

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

wereq.feelsdevil

#GenocideTheTurdWorld

Diddy

Rapist

Similar threads

Users who are viewing this thread

About Us

Online statistics

Welcome to Incels.is - Involuntary Celibate Forum

Welcome! This is a forum for involuntary celibates: people who lack a significant other. Are you lonely and wish you had someone in your life? You're not alone! Join our forum and talk to people just like you.

Follow Us On Social Media

JFL BREAKING NEWS: AI is learning to lie, scheme, and threaten its creators.

wereq.feelsdevil

#GenocideTheTurdWorld

AI is learning to lie, scheme, and threaten its creators - The Economic Times

Grodd

Corrections must be made

SupremeSaint

The curse of intelligence

autistic.goblin

I яape as a hobby

wereq.feelsdevil

#GenocideTheTurdWorld

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

wereq.feelsdevil

#GenocideTheTurdWorld

AI is learning to lie, scheme, and threaten its creators during stress-testing scenarios

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

AI is learning to lie, scheme, and threaten its creators during stress-testing scenarios

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

manletcel1488

Overlord

wereq.feelsdevil

#GenocideTheTurdWorld

weaselbomber

YOU ACTING REAL MULLATTO RIGHT NOW

wereq.feelsdevil

#GenocideTheTurdWorld

Diddy

Rapist

Similar threads

Users who are viewing this thread

Follow Us On Social Media

About Us

Online statistics