Hacker News new | ask | show | jobs
by kgeist 156 days ago
Early LLMs used to have this often. I think's that where the "repetition penalty" parameter comes from. I suspect output quality can be improved with better sampling parameters.