Hacker News new | ask | show | jobs
by mellosouls 1157 days ago
I think they already don't blindly feed it just all the garbage raw data they can find, but prefer high quality, well-prepared sources.

If by that you mean Common Crawl, Wikipedia etc, that's hardly "high quality, well prepared", and very subject to the biases and flaws of the creators who will vary widely in expertise, intelligence and ability.