Y
Hacker News
new
|
ask
|
show
|
jobs
by
yencabulator
397 days ago
"Can run" is pretty easy, it's pretty small and quantized. It runs at 3.7 tokens/second on pure CPU with AMD 8945HS.