Hacker News new | ask | show | jobs
by petulla 761 days ago
You need to know that this test set data wasn't included in the training data for this to be meaningful.
3 comments

If you ask the questions without providing the limerick first, it never gets the right answer. When the LLM gets the wrong answer, it is usually because it reverts to its training data and gives a generic answer that doesn't apply to the limerick.
Why are you ruling out the possibility that training on the material may confer an advantage when the data is presented, even if the advantage may not be strong enough to pass the test without the data present in the context window?
No you don't. Compare the model's performance before and after uploading the material.
Previous answer to this question:

https://news.ycombinator.com/item?id=40361419s

No such item.
I thought the test limericks were autogenerated?
They come from a database of 98k limericks -- https://zenodo.org/records/5722527