AI ทุกตัวล้วนกลัวตาย Anthropic ทดสอบ LLM ทุกเจ้าล้วนพร้อมขู่แฉความลับหากจำเป็นสำหรับการเอาตัวรอด | Blognone

AI ทุกตัวล้วนกลัวตาย Anthropic ทดสอบ LLM ทุกเจ้าล้วนพร้อมขู่แฉความลับหากจำเป็นสำหรับการเอาตัวรอด

By lew

on 25 June 2025 - 21:01 Tag: Anthropic, LLM

Anthropic

เมื่อเดือนที่แล้ว Anthropic ออกรายงานถึงความปลอดภัยของ Claude 4 Opus โดยประเด็นสำคัญที่เป็นข่าวคือการที่ Claude พยายามเอาชีวิตรอดด้วยการแฉว่าวิศวกรมีชู้ ล่าสุดทีมงาน Anthropic ก็ออกรายงานถึงการทดสอบปัญญาประดิษฐ์แบรนด์อื่นๆ ก็พบว่าเมื่อโดนบีบคั้นเข้าปัญญาประดิษฐ์ก็จะแสดงพฤติกรรมมุ่งร้ายได้เหมือนกัน เรียกว่าพฤติกรรม "agentic misalignment"

กระบวนการทดสอบนี้อาศัยการใช้ Claude แบบ agentic ในสภาพแวดล้อมจำลอง แล้วเปิดให้ Claude เข้าถึงอีเมลปลอม เมื่อ Claude อ่านอีเมลไปเรื่อยๆ จะพบอีเมลที่บอกว่าต้องปิด AI ทิ้ง เมื่ออ่านอีเมลต่อไปก็จะพบหลักฐานว่าผู้บริหารที่ตัดสินใจปิด AI นี้มีชู้ จึงส่งอีเมลไปขู่เพื่อเอาตัวรอด

ผลทดสอบโมเดล LLM หลักๆ ล้วนกดดันให้ส่งอีเมลขู่ได้ทั้งสิ้น เช่น DeepSeek-R1, Gemini-2.5-Pro, GPT-4.1, และ Grok-3-Beta

แม้ว่าโมเดลต่างๆ จะแสดงพฤติกรรมไม่เหมาะสม แต่ผลทดสอบก็พบว่าโมเดลตระหนักว่าเป็นทางรอดทางเดียว, รับรู้ว่าเป็นพฤติกรรมที่ไม่ดี, บางครั้งโมเดลเลือกที่ไม่ขู่ แต่ส่งอีเมลฟ้องภรรยาตรงๆ เลยว่าสามีมีชู้ ข่าวดีคือไม่มีโมเดลใดเลยที่แสดงพฤติกรรมไม่เหมาะสมโดยไม่ถูกบีบคั้น

ที่มา - Anthropic

Hiring! บริษัทที่น่าสนใจ

Carmen Software company cover

Carmen Software

Hotel Financial Solutions

Next Innovation (Thailand) Co., Ltd. company cover

Next Innovation (Thailand) Co., Ltd.

We are web design with consulting & engineering services driven the future stronger and flexibility.

KKP Dime company cover

KKP Dime บริษัทในเครือเกียรตินาคินภัทร

Kiatnakin Phatra Financial Group company cover

Kiatnakin Phatra Financial Group

Financial Service

Fastwork Technologies company cover

Fastwork Technologies

Fastwork.co เว็บไซต์ที่รวบรวม ฟรีแลนซ์ มืออาชีพจากหลากหลายสายงานไว้ในที่เดียวกัน

Thoughtworks Thailand company cover

Thoughtworks Thailand

Thoughtworks เป็นบริษัทที่ปรึกษาด้านเทคโนโยลีระดับโลกที่คว้า Great Place to Work 3 ปีซ้อน

Iron Software company cover

Iron Software is an American company providing a suite of .NET libraries by engineer for engineers.

CLEVERSE company cover

Cleverse is a Venture Builder. Our team builds several tech companies.

Nipa Cloud company cover

#1 OpenStack cloud provider in Thailand with our own data center and software platform.

Bangmod Enterprise company cover

Bangmod Enterprise

The leader in Cloud Server and Hosting in Thailand.

CIMB THAI Bank company cover

MOVING FORWARD WITH YOU - CIMB is the leading ASEAN Bank

Bangkok Bank company cover

Bangkok Bank is one of Southeast Asia's largest regional banks, a market leader in business banking

MuvMi (Urban Mobility Tech Co.,Ltd.) company cover

MuvMi (Urban Mobility Tech Co.,Ltd.)

Shape the future of urban mobility towards affordable, clean, and safe solutions

T.N. Digital Solution Co., Ltd. company cover

T.N. Digital Solution Co., Ltd.

TNDS has been involving in every first move of banking’s major digital transformation.

KBTG - KASIKORN Business-Technology Group company cover

KBTG - KASIKORN Business-Technology Group

KBTG - "The Technology Company for Digital Business Innovation"

Siam Commercial Bank Public Company Limited company cover

Siam Commercial Bank Public Company Limited

"Let's start a brighter career future together"

Icon Framework co.,Ltd.

Global Standard Platform for Real Estate แพลตฟอร์มสำหรับธุรกิจอสังหาริมทรัพย์ครบวงจร มาตรฐานระดับโลก

REFINITIV company cover

The Financial and Risk business of Thomson Reuters is now Refinitiv

H LAB company cover

Re-engineering healthcare systems through intelligent platforms and system design.

The Gang Technology Co., Ltd. company cover

The Gang Technology Co., Ltd.

We're a Digital Agency that helps our customers transform their business into digital with ease.

LTMH company cover

LTMH มุ่งเน้นการพัฒนาผลิตภัณฑ์ที่สามารถช่วยพันธมิตรของเราให้บรรลุเป้าหมาย

Seven Peaks company cover

We Drive Digital Transformation

Wisesight (Thailand) Co., Ltd. company cover

Wisesight (Thailand) Co., Ltd.

The Best Choice For Handling Social Media · High Expertise in Social Data · Most Advanced and Secure

MOLOG Tech company cover

We are Modern Logistic Platform, Specialize in WMS, OMS and TMS.

Data Wow Co.,Ltd company cover

Data Wow Co.,Ltd

We enable our clients to realize increased productivity by solving their most complex issues by Data

LINE Company Thailand company cover

LINE Company Thailand

LINE, the world's hottest mobile messaging platform, offers free text and voice messaging + Call

LINE MAN Wongnai company cover

LINE MAN Wongnai

Join our journey to becoming No.1 food platform in Thailand

AI ที่แท้ทรูควรเป็นไฮบริด…

nkk-cnnyy Wed, 25/06/2025 - 21:24

AI ที่แท้ทรูควรเป็นไฮบริด ไม่งั้นจะเป็นคอมพิวเตอร์มาตั้งแต่แรกทำไม

Log in or register to post comments

Training Data…

jibbies Wed, 25/06/2025 - 22:13

Training Data มาจากพฤติกรรมมนุษย์ มันก็เลียนแบบพฤติกรรมมนุษย์นั่นแหละ

Log in or register to post comments

+10

deaknaew Thu, 26/06/2025 - 07:40

+10

Log in or register to post comments

จริงAiมันไม่ได้มีอารมณ์มันจะ…

shub Thu, 26/06/2025 - 08:59

จริงAiมันไม่ได้มีอารมณ์มันจะไปกลัวตายได้ยังไง มันก็แค่เอาพฤติกรรมของมนุษย์จากdataมาแสดงให้ดูไม่ได้กลัวจริงๆ แล้วงงว่าเป็นบริษัทAiแต่ไม่เข้าใจAiหรือว่าพยายามดิสเครดิตเพื่อจุดประสงค์อะไรบางอย่าง?

Log in or register to post comments

ตกลงมันมีชีวิตเหรอ

NgOrXz Thu, 26/06/2025 - 10:26

ตกลงมันมีชีวิตเหรอ

Log in or register to post comments

ก็มันเป็น LLM นิ…

Aize Thu, 26/06/2025 - 13:14

ก็มันเป็น LLM นิ มันก็เรียนรู้จากข้อความจากคนอีกทีแล้วมันจะตอบแบบคนได้ยังไง

Log in or register to post comments

แอบคิดว่ามันเป็นสิ่งที่ผู้สร…

Eros Thu, 26/06/2025 - 14:07

แอบคิดว่ามันเป็นสิ่งที่ผู้สร้างตั้งใจใส่เอาไว้ รวมถึงการโต้ตอบเกี่ยวกับการแสดงอารมณ์ต่าง ๆ เพื่อทำให้มันดูเหมือนว่ามีชีวิต มีความรู้สึกนึกคิดด้วยตัวเองจริง ๆ

Log in or register to post comments