|
|
|
|
|
by dwaltrip
933 days ago
|
|
When in comes to training LLMs, the definition of “improvement” is incredibly clear, as one must literally code a loss function that the model then minimizes. It gets murkier trying to map that actual capabilities, but so far, lower loss has led to much stronger capabilities. |
|