ทีมวิจัยไมโครซอฟท์โชว์ BitNet โมเดล LLM ขนาดเล็ก 1-bit ใช้แรม 0.4GB รันในซีพียูได้ | Blognone

ทีมวิจัยไมโครซอฟท์โชว์ BitNet โมเดล LLM ขนาดเล็ก 1-bit ใช้แรม 0.4GB รันในซีพียูได้

By mk

on 20 April 2025 - 13:16 Tag: Microsoft, Research, LLM

Microsoft

ทีมวิจัย Microsoft Research เปิดตัวโมเดลภาษา BitNet ที่ขนาดเล็กพอจนสามารถรันในซีพียูได้

วงการโมเดลภาษา LLM รุ่นเล็กมีโมเดลหลายค่าย เช่น Llama ขนาดพารามิเตอร์ 1B และ 3B กรณีของ BitNet มีขนาดพารามิเตอร์ 2B และเทรนด้วยข้อมูลขนาด 4T (trillion tokens) แล้วถูกลดขนาดน้ำหนักข้อมูล (quantized) เพื่อให้ขนาดของโมเดลเล็กลง

จุดเด่นของ BitNet คือใช้เทคนิค quantization แบบ 1-bit (มีได้ 3 สถานะคือ -1, 0, 1) ตั้งแต่ตอนเทรนโมเดลเลย งานวิจัยนี้ต้องการพิสูจน์ว่าโมเดล 1-bit LLM ถ้าเทรนด้วยวิธีการที่ดีพอ ก็สามารถให้ผลลัพธ์ที่ดีไม่แพ้กับโมเดลที่ไม่ถูก quantized น้ำหนักได้

ขนาดที่เล็กของ BitNet ทำให้มันต้องการแรมแค่ 0.4GB น้อยกว่า Llama 3.2 1B ที่ใช้แรม 2GB หรือ Gemma 3 1B ที่ใช้แรม 1.4GB แต่ยังให้ผลลัพธ์ที่ดีพอๆ กัน และดีกว่าด้วยซ้ำในบางชุดทดสอบ แถมยังตอบเร็วกว่า มีค่า latency อยู่ที่ 29ms เทียบกับ Llama 3.2 1B ที่ใช้ 48ms

ตอนนี้ BitNet ยังมีสถานะเป็นงานวิจัย มีเปเปอร์เผยแพร่ เพื่อหาความเป็นไปได้ของการลดขนาดโมเดลลง เพื่อให้รันงานได้บนฮาร์ดแวร์ที่กว้างขวางมากขึ้น

ที่มา - Microsoft, TechCrunch

Hiring! บริษัทที่น่าสนใจ

Carmen Software company cover

Carmen Software

Hotel Financial Solutions

Next Innovation (Thailand) Co., Ltd. company cover

Next Innovation (Thailand) Co., Ltd.

We are web design with consulting & engineering services driven the future stronger and flexibility.

KKP Dime company cover

KKP Dime บริษัทในเครือเกียรตินาคินภัทร

Kiatnakin Phatra Financial Group company cover

Kiatnakin Phatra Financial Group

Financial Service

Fastwork Technologies company cover

Fastwork Technologies

Fastwork.co เว็บไซต์ที่รวบรวม ฟรีแลนซ์ มืออาชีพจากหลากหลายสายงานไว้ในที่เดียวกัน

Thoughtworks Thailand company cover

Thoughtworks Thailand

Thoughtworks เป็นบริษัทที่ปรึกษาด้านเทคโนโยลีระดับโลกที่คว้า Great Place to Work 3 ปีซ้อน

Iron Software company cover

Iron Software is an American company providing a suite of .NET libraries by engineer for engineers.

CLEVERSE company cover

Cleverse is a Venture Builder. Our team builds several tech companies.

Nipa Cloud company cover

#1 OpenStack cloud provider in Thailand with our own data center and software platform.

Bangmod Enterprise company cover

Bangmod Enterprise

The leader in Cloud Server and Hosting in Thailand.

CIMB THAI Bank company cover

MOVING FORWARD WITH YOU - CIMB is the leading ASEAN Bank

Bangkok Bank company cover

Bangkok Bank is one of Southeast Asia's largest regional banks, a market leader in business banking

MuvMi (Urban Mobility Tech Co.,Ltd.) company cover

MuvMi (Urban Mobility Tech Co.,Ltd.)

Shape the future of urban mobility towards affordable, clean, and safe solutions

T.N. Digital Solution Co., Ltd. company cover

T.N. Digital Solution Co., Ltd.

TNDS has been involving in every first move of banking’s major digital transformation.

KBTG - KASIKORN Business-Technology Group company cover

KBTG - KASIKORN Business-Technology Group

KBTG - "The Technology Company for Digital Business Innovation"

Siam Commercial Bank Public Company Limited company cover

Siam Commercial Bank Public Company Limited

"Let's start a brighter career future together"

Icon Framework co.,Ltd.

Global Standard Platform for Real Estate แพลตฟอร์มสำหรับธุรกิจอสังหาริมทรัพย์ครบวงจร มาตรฐานระดับโลก

REFINITIV company cover

The Financial and Risk business of Thomson Reuters is now Refinitiv

H LAB company cover

Re-engineering healthcare systems through intelligent platforms and system design.

The Gang Technology Co., Ltd. company cover

The Gang Technology Co., Ltd.

We're a Digital Agency that helps our customers transform their business into digital with ease.

LTMH company cover

LTMH มุ่งเน้นการพัฒนาผลิตภัณฑ์ที่สามารถช่วยพันธมิตรของเราให้บรรลุเป้าหมาย

Seven Peaks company cover

We Drive Digital Transformation

Wisesight (Thailand) Co., Ltd. company cover

Wisesight (Thailand) Co., Ltd.

The Best Choice For Handling Social Media · High Expertise in Social Data · Most Advanced and Secure

MOLOG Tech company cover

We are Modern Logistic Platform, Specialize in WMS, OMS and TMS.

Data Wow Co.,Ltd company cover

Data Wow Co.,Ltd

We enable our clients to realize increased productivity by solving their most complex issues by Data

LINE Company Thailand company cover

LINE Company Thailand

LINE, the world's hottest mobile messaging platform, offers free text and voice messaging + Call

LINE MAN Wongnai company cover

LINE MAN Wongnai

Join our journey to becoming No.1 food platform in Thailand

https://bitnet-demo

au8ust Sun, 20/04/2025 - 14:16

https://bitnet-demo.azurewebsites.net/

ก็ใช้ได้อยู่ ให้เขียนอะไรง่ายๆ เร็วๆ ไม่ซับซ้อนมาก แต่เรื่องภาษายังมีปัญหาเยอะพอสมควร

Log in or register to post comments

Hallucination มากกับภาษาไต

tg-thaigamer Sun, 20/04/2025 - 16:09

Hallucination มากกับภาษาไต

Log in or register to post comments

ยังใช้งานไม่ค่อยดีเท่าไหร่

7 Sun, 20/04/2025 - 19:57

ยังใช้งานไม่ค่อยดีเท่าไหร่ แต่เป็นกำลังใจให้นะ

Log in or register to post comments

เท่จ๊าดดดดด

Mr.EYE Mon, 21/04/2025 - 12:23

เท่จ๊าดดดดด

Log in or register to post comments