Hacker News new | ask | show | jobs
by SXX 28 days ago
Data. Google has access to unphasmable amount of real human-created data with zero expectations of privacy (wink wink Apple): videos, photos, search, navigation, mobile app usage including competition platforms, emails, etc.

Both Anthropic and OpenAI only has access to whatever they can buy or steal.

And it's becoming increasingly hard to get fresh uncontaminated data for training. No amount of money can buy that.

1 comments

I suspect it's less true now that synthetic data has worked so well, and multimodal doesn't seem to transfer as well as many would've hoped

Claude Code and Codex are a big advantage, vs Gemini CLI which might be killedbygoogle soon? https://news.ycombinator.com/item?id=48196867

> Both Anthropic and OpenAI only has access to whatever they can buy or steal. A trillion can buy you quite a lot! Like offer some company a ton of money for data, and if they say no simply buy said company. Bonus points if it's someone like Atlassian who's stock price is getting hammered largely because of you.