Hacker News new | ask | show | jobs
by ranger_danger 55 days ago
Update: here is a patched llama.cpp and quantized model for desktop use: https://github.com/solwyc/talkie-1930-13b-it-q5