Hacker News new | ask | show | jobs
by hacker_homie 66 days ago
Llama.cpp added the ability load/switch models on demand with the max-models and models preset flags.