|
|
|
|
|
by bbor
1062 days ago
|
|
It would be a bit of a scandal, and IMO too much hassle to sneak in. These models are trained on massive amounts of text - specifically anticipating which metrics people will care about and generating synthetic data just for them seems extra. But not an expert or OP! |
|
For a particular model you try to minimally do this by separating a test and validation set, but on a meta-meta level, it's easy to see it happening.