|
|
|
|
|
by famouswaffles
1064 days ago
|
|
Assuming they were doing that, Fine-tuning on benchmarks isn't the same as test leakage/testing on training data. No researcher is intentionally training on test data. If it performs about as well in instances it has never seen before (test set) then it's not overfit to the test. |
|