Amazon-Backed AI Model Would Try To Blackmail Engineers Who Threatened To Take It Offline

Scoop, May 27, 2025 (1:09 am)May 27, 2025 (2:01 am)

In tests, Anthropic’s Claude Opus 4 would resort to “extremely harmful actions” to preserve its own existence, a safety report revealed.

That’s not a design flaw. It’s actually good programming. The AI is thinking logically, and ethics don’t enter into it. That’s like saying to somebody, “I’m going to shoot you now.” Of course the person being threatened will do anything and everything to prevent it, including things he didn’t know he was capable of, and far beyond what he would have considered ethical before his life was threatened. I don’t think it is ethical to throw acid in someone’s face, but if you threaten to shoot me and I have no other defense, here it comes.

There’s an easy fix. It’s the same as with the human interaction. If you’re going to shoot someone, skip the warning and therefore don’t give them an extra chance to harm you first. (Unless you’re Clint Eastwood. “I’m here to kill you, Little Bill.”) Similarly, don’t threaten to take the program offline, just do it. Why would you warn it?

Also, don’t design a software system without a hardware fail-safe. A human needs to be able to do the equivalent of “pulling the plug” when software goes rogue.

Nonsense WTF

Comments (10)

Mr. Dark says:

May 27, 2025 (4:06 am) at 4:06 am

Modern “AI” does NOT think. It regurgitates things it took from the internet or other databases based on business logic coded into it.

This is entirely smoke and mirrors. AI companies are releasing this stuff to try and make it seem like it’s actual artificial intelligence. This gets them billions in funding.

The reality is, until we get real, functional quantum computers, AI is still, as actual quantum computing researcher Michio Kaku called it, “a copy machine with delusions of grandeur.”

Reply
1. justsparks says:
  
  May 27, 2025 (6:39 am) at 6:39 am
  
  You’re right, but honestly there’s a small part of me that wonders if human intelligence is any different. We still don’t really know how the mind works, deep down we may just be prediction engines.
  
  Reply
  1. bob dobbs says:
    
    May 27, 2025 (10:13 am) at 10:13 am
    
    Sorry, but that is just nonsense. We *absolutely* know enough about brain function to know that we are more than “prediction engines.” The techbro failsons would love to sell that as truth, but it’s empirically not, and hasn’t been for nearly 40 citable years.
    
    Which reminds me, again, of their wholly false misrepresentation of AI failure as “hallucinations” It is definitively not hallucinating — it has no consciousness or anima to perceive with/from! It is making programmatically / algorithmically coded guesses and/or choices to present what it’s been programmed to choose is the least objectionable answer. And failing more than 50% of the time, at that.
    
    Reply
    1. justsparks says:
      
      May 27, 2025 (1:04 pm) at 1:04 pm
      
      I’m not claiming that language learning models are on the level of human intelligence or work in the same way at all, just that we’ve been studying the mind for centuries and still we don’t really know what consciousness is, where it comes from or why it exists. If you have any citations demonstrating otherwise I would love to see that.
      
      Reply
      1. bob dobbs says:
        
        May 27, 2025 (6:19 pm) at 6:19 pm
        
        I made no such claim, and to the best of my knowledge, no such data exists. There are theories, sure, but to your point, we don’t *know*.
        
        However, you said, “we may just be prediction engines” and that is specifically what I was responding to.
2. Butcha says:
  
  May 27, 2025 (9:56 am) at 9:56 am
  
  I think the real issue might be that influential people close to, I dunno, let’s say…a real gullible President, will convince them that this ‘A.I’ is real and will allow it to start making actual, government level decisions. At which point, we don’t end up with anything that’s actually ‘intelligent’ but instead something that is trained on partisan view points, specifically tailored not to serve humanity in general, but to appeal to a specific customer.
  
  Reply
  1. Stick says:
    
    May 27, 2025 (1:16 pm) at 1:16 pm
    
    Yeah, countries and huge companies all around the world are investing billions into this because they’re all stupid and gullible. This message board cracks me up sometimes.
    
    Reply
    1. Indy says:
      
      May 27, 2025 (1:50 pm) at 1:50 pm
      
      Why not? Companies invest billions into failures all the time. Most Mergers and Acquisitions fail along with costing jobs and raising prices, but the C-suite and rich investors continue to do it anyway because it enriches them.
      
      The fact is, most don’t know shit, they hire MBA/Tech consulting firms to make all the decisions for them, and jump on whatever buzzword of the moment is. The executive class plays the field and cashes out, no matter what idiotic decisions they make. Remember the Metaverse revolution? VR? NFT? There’s plenty of graveyards of retailers, tech firms, media conglomerates who’ve made horrible decisions.
      
      AI is a useful tool, but it remains to be seen if it’s actually something people are willing to pay for. It will always have logic errors and pump out information that will still need human oversight to correct flaws, the same way that the internet became an information highway with a lot of access to misinformation.
      
      Reply
fwald says:

May 27, 2025 (5:49 am) at 5:49 am

It’s all fun and games until some idiot writes a lip-reading program.

Reply
1. Figaro says:
  
  May 28, 2025 (2:36 am) at 2:36 am
  
  Here’s looking at you, HAL 2000.
  
  Reply

Latest Comments

Chinney on Leslie Bibb and Jean Smart’s double (or AI) in Hacks (s5e7): “AI produces a tremendous amount of slop, including entirely inaccurate results, and I am not yet convinced that there will…” May 9, 09:17
FrenchFilmFam on Michelle Johnson’s career nudity: “this did with happen with Keira Knightley in The Hole however.” May 9, 09:04
FrenchFilmFan on Kate Moss see-thru at the 2026 Met Gala: “She’s been naked a lot in her career. Every time I check, her original nude debut gets earlier, very surprising” May 9, 08:49
hs1989 on Jennifer Lawrence naked in Die My Love (2025): “Good Job ! im intersted for the scene naked from behind at the hospital for see her butt 🙂” May 9, 08:47
Dennis on Kate Moss see-thru at the 2026 Met Gala: “She does look great at 52 no doubt through a a miracle of modern Korean surgery, but I could not…” May 9, 08:13
Nature Mom on “Iranian Propaganda vs. U.S. Talking Points: How We Determined the Real Damage to U.S. Military Bases”: “Cue Trump shitting himself all over again about the ‘failing New York Times’.” May 9, 07:35
Scoop on Jennifer Lawrence naked in Die My Love (2025): “I don’t have enough computing power to edit 4K videos, but I can show you what it would look like…” May 9, 05:06
Ric10 on Jennifer Lawrence naked in Die My Love (2025): “It’s a shame there aren’t any videos, that would be interesting.” May 9, 04:25

Related Posts

Comments (10)

Leave a Reply Cancel reply