|
|
|
|
|
by RugnirViking
1165 days ago
|
|
that's exactly why it doesnt generate the most likely sequence of tokens! They are chosen at random based on the probabilities assigned by the model, so there is a chance of unusual output. In the API you can tweak the "temprature" which weights this towards more novel output |
|
Decoding methods also matter, and it’s a shame we aren’t given token probabilities (or any insight into model output) so we have more creative control over how to decode the output. Some of the better literature I’ve seen involving creative writing did have novel decoding methods