Hacker News new | ask | show | jobs
by pja 805 days ago
llama.cpp can quantize a model for you:

https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#qu...

1 comments

Thanks!!