Y
Hacker News
new
|
ask
|
show
|
jobs
by
am17an
115 days ago
Honestly you can run this on a 16GB VRAM GPU with llama.cpp. Just try it!