Hacker News new | ask | show | jobs
by crazypython 1045 days ago
The GPT-3.0 "davinci-instruct-beta" models have been returning non-deterministic logprobs as early as early 2021. This is speculation. CUDA itself often has nondeterminism bugs.

text-davinci-001 and text-davinci-002 were trained through FeedMe and SFT, while text-davinci-003 was RLHF; the models themselves have more variance at high temperature.

1 comments

What about the foundation models, i.e. davinci and code-davinci-002?