Home
Crypto
- Altcoins
- Bitcoin
- Ethereum
- Monero
- XRP
- Zcash
Web3
DeFi
NFTs

No Result

View All Result

Home
Crypto
- Altcoins
- Bitcoin
- Ethereum
- Monero
- XRP
- Zcash
Web3
DeFi
NFTs

No Result

View All Result

No Result

View All Result

LLaMa with 7 Billion Parameters Achieves Lightning-Fast Inference on Apple M2 Max Chip

by Altszn.com

in Metaverse, Web3

LLaMa with 7 Billion Parameters Achieves Lightning-Fast Inference on Apple M2 Max Chip

399

SHARES

2.3k

VIEWS

Share on Facebook Share on Twitter

A groundbreaking achievement in the field of AI has been unveiled as the LLaMa model with an astonishing 7 billion parameters now runs at an impressive speed of 40 tokens per second on a MacBook equipped with the cutting-edge M2 Max chip. This remarkable feat was made possible through a recent update to the GIT repository by Greganov, who successfully implemented model inference on the Metal GPU, a specialized accelerator found in Apple’s latest chips.

LLaMa with 7 Billion Parameters Achieves Lightning-Fast Inference on Apple M2 Max Chip — Credit: Metaverse Post (mpost.io)