| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by petulla 761 days ago
	You need to know that this test set data wasn't included in the training data for this to be meaningful.

3 comments

sftombu 761 days ago

If you ask the questions without providing the limerick first, it never gets the right answer. When the LLM gets the wrong answer, it is usually because it reverts to its training data and gives a generic answer that doesn't apply to the limerick.

link

trifurcate 761 days ago

Why are you ruling out the possibility that training on the material may confer an advantage when the data is presented, even if the advantage may not be strong enough to pass the test without the data present in the context window?

link

a_wild_dandan 761 days ago

No you don't. Compare the model's performance before and after uploading the material.

link

sftombu 761 days ago

Previous answer to this question:

https://news.ycombinator.com/item?id=40361419s

link

sumedh 761 days ago

No such item.

link

lmeyerov 761 days ago

I thought the test limericks were autogenerated?

link

sftombu 761 days ago

They come from a database of 98k limericks -- https://zenodo.org/records/5722527

link