Hacker News new | ask | show | jobs
by brucethemoose2 1062 days ago
Llama.cpp, through kobold.cpp, offloading some of it to ram.