|
|
|
|
|
by buildbot
936 days ago
|
|
This conspiracy always comes up - don't you think that they test the output of the model revisions on probably 1000s of downstream tasks at this point? Bad responses are hard to reason about, could be prompting, could be a model revision, could just be bad luck. |
|
LLMs are known to be compute/energy hungry to execute. It is a developing technology, if not downright experimental.
Therefore, this explanation is very likely. I cannot see the reason to call this a conspiracy.