DeepMind รายงานความก้าวหน้าปัญญาประดิษฐ์ MuZero เล่นเกม Atari ให้ชนะได้ แม้ไม่รู้กติกา

By arjin

on 25 December 2020 - 15:38 Tag: DeepMind, Artificial Intelligence, MuZero

DeepMind

DeepMind เผยแพร่ความคืบหน้าของปัญญาประดิษฐ์ MuZero ที่พัฒนาต่อจาก AlphaZero โดยตั้งเป้าหมายเพื่อหาอัลกอริทึมสำหรับโจทย์แบบไม่เจาะจง สามารถเอาชนะในเกมใด ๆ ก็ตาม ที่ไม่รู้กฎกติกามาก่อน

ที่ผ่านมาปัญญาประดิษฐ์ของ DeepMind จะแก้ปัญหาได้เฉพาะเรื่อง (Domain) และอาศัยองค์ความรู้ (Knowledge) ทั้งข้อมูลการเล่นในอดีต จนถึงกติกาการเล่น ซึ่งแนวทางนี้จะพบปัญหาเมื่อต้องเล่นเกมแบบ Atari ที่รูปแบบกติกาไม่ได้บอกชัดเจนมาก และเกมก็ซับซ้อนขึ้น (ดูภาพประกอบด้านล่าง)

DeepMind อธิบายเพิ่มเติมว่าปัญญาประดิษฐ์แบบนี้ ถอดแบบจากการคิดแก้ปัญหาของมนุษย์มากขึ้น เช่น เมื่อเราเจอเมฆครึ้ม เราก็จะเดาว่าฝนน่าจะตก (Predict) และหยิบร่มติดตัวเพื่อไม่ให้เปียกฝน (Decide) ระบบการคิดดังกล่าวเป็นการตัดสินใจจากสภาพที่เผชิญอยู่ตอนนั้น ไม่ใช่การดูภาพรวมทั้งหมด (เช่นการดูสภาพอากาศรวมทั้งแผนที่) โดย MuZero ใช้ 3 องค์ประกอบหลักในการตัดสินใจคือ คุณค่าของตำแหน่งปัจจุบัน (Value), การกระทำที่ดีที่สุด (Policy) และผลลัพธ์จากการกระทำก่อนหน้า (Reward)

ตัวอย่างที่ DeepMind นำมาอธิบายคือเกม Ms Pac-Man พบว่ายิ่งให้เวลาตัดสินใจต่อครั้งมากขึ้น ผลลัพธ์ก็ดีขึ้นตาม เช่นเดียวกับจำนวนทางเลือก หากให้ทางเลือกที่มากขึ้น ผลลัพธ์ก็ดีขึ้นเช่นกัน และแม้จำกัดทางเลือกต่อครั้งเหลือเพียง 6-7 วิธี ซึ่งน้อยมาก ผลลัพธ์ในการเล่นเกมก็ยังออกมาดี

ที่มา: DeepMind และ Engadget

DeepMind MuZero

MuZero

Hiring! บริษัทที่น่าสนใจ

Carmen Software

Hotel Financial Solutions

Next Innovation (Thailand) Co., Ltd.

We are web design with consulting & engineering services driven the future stronger and flexibility.

KKP Dime

KKP Dime บริษัทในเครือเกียรตินาคินภัทร

Kiatnakin Phatra Financial Group

Financial Service

Fastwork Technologies

Fastwork.co เว็บไซต์ที่รวบรวม ฟรีแลนซ์ มืออาชีพจากหลากหลายสายงานไว้ในที่เดียวกัน

Thoughtworks Thailand

Thoughtworks เป็นบริษัทที่ปรึกษาด้านเทคโนโยลีระดับโลกที่คว้า Great Place to Work 3 ปีซ้อน

Iron Software

Iron Software is an American company providing a suite of .NET libraries by engineer for engineers.

CLEVERSE

Cleverse is a Venture Builder. Our team builds several tech companies.

Nipa Cloud

#1 OpenStack cloud provider in Thailand with our own data center and software platform.

Bangmod Enterprise

The leader in Cloud Server and Hosting in Thailand.

CIMB THAI Bank

MOVING FORWARD WITH YOU - CIMB is the leading ASEAN Bank

Bangkok Bank

Bangkok Bank is one of Southeast Asia's largest regional banks, a market leader in business banking

MuvMi (Urban Mobility Tech Co.,Ltd.)

Shape the future of urban mobility towards affordable, clean, and safe solutions

T.N. Digital Solution Co., Ltd.

TNDS has been involving in every first move of banking’s major digital transformation.

KBTG - KASIKORN Business-Technology Group

KBTG - "The Technology Company for Digital Business Innovation"

Siam Commercial Bank Public Company Limited

"Let's start a brighter career future together"

Icon Framework co.,Ltd.

Global Standard Platform for Real Estate แพลตฟอร์มสำหรับธุรกิจอสังหาริมทรัพย์ครบวงจร มาตรฐานระดับโลก

REFINITIV

The Financial and Risk business of Thomson Reuters is now Refinitiv

H LAB

Re-engineering healthcare systems through intelligent platforms and system design.

The Gang Technology Co., Ltd.

We're a Digital Agency that helps our customers transform their business into digital with ease.

LTMH

LTMH มุ่งเน้นการพัฒนาผลิตภัณฑ์ที่สามารถช่วยพันธมิตรของเราให้บรรลุเป้าหมาย

Seven Peaks

We Drive Digital Transformation

Wisesight (Thailand) Co., Ltd.

The Best Choice For Handling Social Media · High Expertise in Social Data · Most Advanced and Secure

MOLOG Tech

We are Modern Logistic Platform, Specialize in WMS, OMS and TMS.

Data Wow Co.,Ltd

We enable our clients to realize increased productivity by solving their most complex issues by Data

LINE Company Thailand

LINE, the world's hottest mobile messaging platform, offers free text and voice messaging + Call

LINE MAN Wongnai

Join our journey to becoming No.1 food platform in Thailand

เข้าใจว่าเผยแพร่แบบไม่ผ่าน

zyzzyva Fri, 25/12/2020 - 17:22

เข้าใจว่าเผยแพร่แบบไม่ผ่าน peer review ใน arXiv:1911.08265 ตั้งแต่ปลายปีที่แล้ว แต่คุณภาพระดับนี้ DeepMind น่าจะส่งลง Nature ซึ่งก็ได้ลงจริง ถึงจะไม่ได้เป็นหน้าปกเหมือนพวก AlphaGo, AlphaZero (Science), AlphaStar ก็ตาม

รอดู AI Dota2 ในกติกาปกติครับ

massacre Fri, 25/12/2020 - 17:35

รอดู AI Dota2 ในกติกาปกติครับ

สอนให้ AI

100dej Fri, 25/12/2020 - 17:53

สอนให้ AI มองโลกแคบเหมือนมนุษย์ ทำให้ความสามารถถูกจำกัด?

อยากเห็นตอนมันเล่น

Hoo Fri, 25/12/2020 - 21:11

อยากเห็นตอนมันเล่น
เหมือน vdo ตอนยังเป็น Q Learning จัง

อยากทราบครับว่ามันสามารถใช้คว

komkit0710 Sat, 26/12/2020 - 00:15

อยากทราบครับว่ามันสามารถใช้ความรู้จากการเล่นอย่างหนึ่ง มาเป็นพื้นฐานของการเล่นอีกอย่างหนึ่งได้ไหมครับ เช่น RPG เกมนึงเป็น รู้ว่าอะไรคือมอนส์เตอร์ อะไรคือ NPC สามารถนำความรู้เหล่านี้ไปทดลองใช้กับอีกเกมนึงได้หรือไม่ หรือต้องเริ่มจากไม่มีประสบการณ์ใดๆ เล่นเกมไม่เป็นเลยเลยเท่านั้น

ฟังดูแล้วเหมือนจะอ่อนกว่ารุ่น

aeksael Sat, 26/12/2020 - 02:45

ฟังดูแล้วเหมือนจะอ่อนกว่ารุ่นพี่ยังไงพิกล

คุณค่าของตำแหน่งปัจจุบัน

badboyz08 Sat, 26/12/2020 - 09:52

คุณค่าของตำแหน่งปัจจุบัน (Value), โลกกำลังแย่
การกระทำที่ดีที่สุด (Policy) กำจัดมนุษย์
และผลลัพธ์จากการกระทำก่อนหน้า (Reward) ไม่มีมนุษย์ โลกปลอดภัย
เย้ UwU