Hacker News new | ask | show | jobs
by jerpint 1193 days ago
There have been CPU implementations of LLAMA (7b parameters, comparable in size) with very impressive performance