Hacker News new | ask | show | jobs
by BoberMod 1195 days ago
There is also a gpu-acelerated fork of the original repo

https://github.com/remixer-dec/llama-mps

1 comments

> For 7B model, it always goes above 32gb of RAM,

That's double of what Tinygrad uses

Tinygrad is using openCL right?