|
|
|
|
|
by stephenroller
1021 days ago
|
|
The llama1 team did not have a validation set. I don’t know what the Llama2 team did - I left before seeing any of the details. My guess is Llama2 upsamples Wikipedia a good bit, but given they didn’t report any information about training data, it’s hard to say. |
|