อันนี้ต้องโทษ AI? พบกราฟนำเสนอในงานเปิดตัว GPT-5 หลายอัน สเกลผิดเพี้ยนไปมาก | Blognone

อันนี้ต้องโทษ AI? พบกราฟนำเสนอในงานเปิดตัว GPT-5 หลายอัน สเกลผิดเพี้ยนไปมาก

By arjin

on 8 August 2025 - 13:29 Tag: OpenAI, ChatGPT, LLM

OpenAI

มีประเด็นเล็ก ๆ จากงานเปิดตัวโมเดล GPT-5 ของ OpenAI เมื่อคืนนี้ เมื่อทีมงานพูดถึงผลการทดสอบความสามารถโมเดล AI ในด้านต่าง ๆ และเปรียบเทียบกับโมเดลรุ่นก่อนหน้า ด้วยกราฟแท่งของคะแนนเปรียบเทียบกัน แต่มีอะไรให้เอ๊ะอยู่บ้าง

กราฟแรกเป็นอัตราการสร้างผลลัพธ์ลวงที่ไม่สามารถทำได้จริง (Deception Rate) ซึ่งระบุว่า GPT-5 ทำได้ที่ 50% แต่กราฟที่แสดงนั้นดูเพี้ยนไปเมื่อเทียบกับ o3 ที่ 47.4% ทั้งนี้ในรายงานฉบับเต็มของ OpenAI ตัวเลขจริงอยู่ที่ 16.5% ด้วย

กราฟอีกอันเป็นผลทดสอบ SWE-bench เรื่อง Software Engineering ซึ่ง GPT-5 ทำคะแนนได้ 74.9% ดีกว่า o3 ที่ 69.1% แต่กราฟที่แสดงดูผิดสัดส่วนไปมาก โดยเฉพาะเมื่อเทียบกับ GPT-4o ที่ 30.8%

Sam Altman ซีอีโอ OpenAI ตอบประเด็นนี้ใน X ว่าเป็นการสร้างกราฟที่ผิดพลาดมาก อย่างไรก็ตามในโพสต์ทางการบนเว็บ OpenAI กราฟทั้งหมดแสดงผลอย่างถูกต้องแล้ว เช่นเดียวกับตัวแทนฝ่ายการตลาดของ OpenAI ออกมาชี้แจงว่าได้แก้ไขตารางนี้แบบทางการไปแล้ว

ที่มา: The Verge

Hiring! บริษัทที่น่าสนใจ

Carmen Software company cover

Carmen Software

Hotel Financial Solutions

Next Innovation (Thailand) Co., Ltd. company cover

Next Innovation (Thailand) Co., Ltd.

We are web design with consulting & engineering services driven the future stronger and flexibility.

KKP Dime company cover

KKP Dime บริษัทในเครือเกียรตินาคินภัทร

Kiatnakin Phatra Financial Group company cover

Kiatnakin Phatra Financial Group

Financial Service

Fastwork Technologies company cover

Fastwork Technologies

Fastwork.co เว็บไซต์ที่รวบรวม ฟรีแลนซ์ มืออาชีพจากหลากหลายสายงานไว้ในที่เดียวกัน

Thoughtworks Thailand company cover

Thoughtworks Thailand

Thoughtworks เป็นบริษัทที่ปรึกษาด้านเทคโนโยลีระดับโลกที่คว้า Great Place to Work 3 ปีซ้อน

Iron Software company cover

Iron Software is an American company providing a suite of .NET libraries by engineer for engineers.

CLEVERSE company cover

Cleverse is a Venture Builder. Our team builds several tech companies.

Nipa Cloud company cover

#1 OpenStack cloud provider in Thailand with our own data center and software platform.

Bangmod Enterprise company cover

Bangmod Enterprise

The leader in Cloud Server and Hosting in Thailand.

CIMB THAI Bank company cover

MOVING FORWARD WITH YOU - CIMB is the leading ASEAN Bank

Bangkok Bank company cover

Bangkok Bank is one of Southeast Asia's largest regional banks, a market leader in business banking

MuvMi (Urban Mobility Tech Co.,Ltd.) company cover

MuvMi (Urban Mobility Tech Co.,Ltd.)

Shape the future of urban mobility towards affordable, clean, and safe solutions

T.N. Digital Solution Co., Ltd. company cover

T.N. Digital Solution Co., Ltd.

TNDS has been involving in every first move of banking’s major digital transformation.

KBTG - KASIKORN Business-Technology Group company cover

KBTG - KASIKORN Business-Technology Group

KBTG - "The Technology Company for Digital Business Innovation"

Siam Commercial Bank Public Company Limited company cover

Siam Commercial Bank Public Company Limited

"Let's start a brighter career future together"

Icon Framework co.,Ltd.

Global Standard Platform for Real Estate แพลตฟอร์มสำหรับธุรกิจอสังหาริมทรัพย์ครบวงจร มาตรฐานระดับโลก

REFINITIV company cover

The Financial and Risk business of Thomson Reuters is now Refinitiv

H LAB company cover

Re-engineering healthcare systems through intelligent platforms and system design.

The Gang Technology Co., Ltd. company cover

The Gang Technology Co., Ltd.

We're a Digital Agency that helps our customers transform their business into digital with ease.

LTMH company cover

LTMH มุ่งเน้นการพัฒนาผลิตภัณฑ์ที่สามารถช่วยพันธมิตรของเราให้บรรลุเป้าหมาย

Seven Peaks company cover

We Drive Digital Transformation

Wisesight (Thailand) Co., Ltd. company cover

Wisesight (Thailand) Co., Ltd.

The Best Choice For Handling Social Media · High Expertise in Social Data · Most Advanced and Secure

MOLOG Tech company cover

We are Modern Logistic Platform, Specialize in WMS, OMS and TMS.

Data Wow Co.,Ltd company cover

Data Wow Co.,Ltd

We enable our clients to realize increased productivity by solving their most complex issues by Data

LINE Company Thailand company cover

LINE Company Thailand

LINE, the world's hottest mobile messaging platform, offers free text and voice messaging + Call

LINE MAN Wongnai company cover

LINE MAN Wongnai

Join our journey to becoming No.1 food platform in Thailand

คีย์พร้อมต์ให้เอไอสร้าง…

pit Fri, 08/08/2025 - 22:08

คีย์พร้อมต์ให้เอไอสร้าง แล้วมั่นใจมากจนไม่ตรวจปรู๊ฟก่อน?

Log in or register to post comments