Y
Hacker News
new
|
ask
|
show
|
jobs
by
lreeves
199 days ago
I run the larger version of it on a Threadripper with 512GB RAM and a 32GB GPU for the non-expert layers and context, using llama.cpp. Performs great, however god forbid you try to get that much memory these days.