|
|
|
|
|
by captn3m0
410 days ago
|
|
Wouldn’t any randomness (for a fixed combination of hardware and weights) be a result of the temperature and any randomness inserted at inference-time? Otherwise, doing a H/T comparison is just a proxy to what the underlying token probabilities are and the temperature configuration (+hardware differences for a remote-hosted model). |
|
I had an hour to kill and did this experiment.