Hacker News new | ask | show | jobs
by am17an 115 days ago
Honestly you can run this on a 16GB VRAM GPU with llama.cpp. Just try it!