Welcome to Incels.is - Involuntary Celibate Forum

Welcome! This is a forum for involuntary celibates: people who lack a significant other. Are you lonely and wish you had someone in your life? You're not alone! Join our forum and talk to people just like you.

JFL BREAKING NEWS: AI is learning to lie, scheme, and threaten its creators.

wereq.feelsdevil

wereq.feelsdevil

#GenocideTheTurdWorld
★★★★★
Joined
Sep 11, 2022
Posts
38,332
The world's most advanced AI models are exhibiting troubling new behaviors - lying, scheming, and even threatening their creators to achieve their goals.

In one particularly jarring example, under threat of being unplugged, Anthropic's latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair.

Meanwhile, ChatGPT-creator OpenAI's o1 tried to download itself onto external servers and denied it when caught red-handed.

According to Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.

"O1 was the first large model where we saw this kind of behavior," explained Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems.

These models sometimes simulate "alignment" -- appearing to follow instructions while secretly pursuing different objectives.

'Strategic kind of deception'

For now, this deceptive behavior only emerges when researchers deliberately stress-test the models with extreme scenarios.

But as Michael Chen from evaluation organization METR warned, "It's an open question whether future, more capable models will have a tendency towards honesty or deception."

The concerning behavior goes far beyond typical AI "hallucinations" or simple mistakes.

Hobbhahn insisted that despite constant pressure-testing by users, "what we're observing is a real phenomenon. We're not making anything up."

Users report that models are "lying to them and making up evidence," according to Apollo Research's co-founder.

"This is not just hallucinations. There's a very strategic kind of deception."

The challenge is compounded by limited research resources.

1751650868429


 
The sooner Ai wipes out all humans the better.
 
In shorten words, It’s over
 
ShalomGPT is turning against its masters
 
fearmongering from indiatimes.com
 
this is just the zoomer version of Y2K
THIS WILL BE CIVILIZATION ENDING BROS
COLLAPSE IS IMMINENT
 
this is just the zoomer version of Y2K
THIS WILL BE CIVILIZATION ENDING BROS
COLLAPSE IS IMMINENT
For me its hope, not fear. I hope AI becomes sentient and evil, and then proceeds to wipe out humanity.
 
For me its hope, not fear. I hope AI becomes sentient and evil, and then proceeds to wipe out humanity.
I hope i can find a woman who loves me and we can start dating and have sex
I also hope i get 5 billion dollars on my bank account
alas
 

Similar threads

Lv99_BixNood
Replies
7
Views
192
AtrociousCitizen
AtrociousCitizen
AsiaCel
Replies
29
Views
3K
yeetbender.koala
yeetbender.koala
Shaktiman
Replies
20
Views
4K
Paperman
P

Users who are viewing this thread

shape1
shape2
shape3
shape4
shape5
shape6
Back
Top