Hacker News new | ask | show | jobs
by COAGULOPATH 641 days ago
I've heard rumors that GPT4's training data included "a custom dataset of college textbooks", curated by hand. Nothing beyond that.

https://www.reddit.com/r/mlscaling/comments/14wcy7m/comment/...