Hacker News new | ask | show | jobs
by Nevermark 1 day ago
Subtleties can be hard to convey in text.

"Novel scope" just meant the scope beyond training data, discoverable by experimenting, that a post-trained model was able to generalize well to.

It didn't mean arbitrary or alien to training data.

Thesis: The greater a model's scope of generalization, the greater evidence for "understanding" instead of fitting. I can't think of a better way to compare levels of understanding, for models of comparable size, than by how far each of them can generalize beyond training data.

I didn't always follow you either. But I didn't think you were being flippant or unreasonable when I didn't.

No worries. I appreciated being pushed to think more clearly, and you made points that improved my thinking directly.

EDIT ——— I think trading walls of text was a challenge. It seemed sensible to try and respond to "everything", but I can see that one specific at a time would have worked better. And I need to find someone in my vicinity to bash ideas with. I have settled in a new area, and miss that. So thanks.