Hacker News new | ask | show | jobs
by mliker 373 days ago
OpenAI and Anthropic rely on multiple data vendors for their models so that no outside company is aware of how they train their proprietary models. Forbes reported the other day that OpenAI had been winding down their usage of Scale data: https://www.forbes.com/sites/richardnieva/2025/06/12/scale-a...
2 comments

And scale doesn’t even have the best data among these vendors so I also don’t get this argument
What are some other options ?
Good one Zuck.
Inodata
Yeah, but they know how to get the quality human labeled data at scale better than anyone — and they know what Anthropic and OpenAI wanted — what made it quality