Hacker News new | ask | show | jobs
Local Code Chatbot Running on 2GB RAM (twitter.com)
1 points by amasad 1092 days ago
1 comments

There is even some untapped headroom, as they quantized to Q4 instead of using K-quant.

Not to speak of the potential hooked up to a vector db, swapping out LORAs for different languages and such.