Hacker News new | ask | show | jobs
by yencabulator 397 days ago
"Can run" is pretty easy, it's pretty small and quantized. It runs at 3.7 tokens/second on pure CPU with AMD 8945HS.