Hacker News new | ask | show | jobs
by underlines 1183 days ago
just run the 13b model 4bit quantized locally, it's already better than the 7b-8bit and you can turn down the temperature to 0 to get repeatable results.