|
|
|
|
|
by mistymountains
1067 days ago
|
|
They don’t. They simply assume the model’s most likely output is meaningfully correlated with true rankings even though it was never trained on this task and certainly has not been trained to output the most likely prompt given a prompt in some meaningful order. It’s hogwash. |
|