Y
Hacker News
new
|
ask
|
show
|
jobs
by
kgeist
156 days ago
Early LLMs used to have this often. I think's that where the "repetition penalty" parameter comes from. I suspect output quality can be improved with better sampling parameters.