|
|
|
|
|
by nkozyra
94 days ago
|
|
The problem with evals is the underlying rubric will always be either subjective, or a quantitative score based on something that is likely now baked into the training set directly. You kind of have to go on "feels" for a lot of this. |
|