Hacker News new | ask | show | jobs
by thw09j9m 399 days ago
It's Google. Assume the training set contains, as a subset, the entirety of all public digitized information. How would you like to them to share it?
2 comments

If they wanna do research where they claim they did something novel, without showing that they didn't just "look it up" in their massive training set, then yes, they should share what is and what isn't contained within.
As a tar, please.