Hacker News new | ask | show | jobs
by RugnirViking 1165 days ago
that's exactly why it doesnt generate the most likely sequence of tokens! They are chosen at random based on the probabilities assigned by the model, so there is a chance of unusual output. In the API you can tweak the "temprature" which weights this towards more novel output
1 comments

I’m very familiar with temperature and other parameters you can use to tweak output. They can take you decently far! GPT-2 can produce very coherent convincing output even today if you know what to tweak

Decoding methods also matter, and it’s a shame we aren’t given token probabilities (or any insight into model output) so we have more creative control over how to decode the output. Some of the better literature I’ve seen involving creative writing did have novel decoding methods