Hacker News new | ask | show | jobs
by waldrews 809 days ago
Well, in this case we're literally asking if the model can remember new facts, not generalize, so seems like a legit first level test; second level might be, can it answer a question incorporating that specific knowledge in a broader question.