Hacker News new | ask | show | jobs
by unityByFreedom 1933 days ago
Evaluating the quality of language models is a challenge in itself. This post just presents another way to see how much your model can understand alongside a new model. This is typical when presenting new tech for which old evaluation methods might not tell you anything.

It's not really about getting the answer to that question as it is about figuring out how much information your model can glean from text.