Hacker News new | ask | show | jobs
by mschild 1132 days ago
There certainly are other spaces where open source code is hosted and available, but the default for most is GitHub. I think it's in a similar position to Google 10 years ago. Sure there are other search engines, but Google is by and large the standard one.

That does put Microsoft in the unique position to have direct unfettered access to any and all open source code on GitHub without restrictions. Unless you or I get the same kind of direct access without rate limiting and antibot protection, then they do dominate and have an advantage over everyone else.

2 comments

Not sure if you posted before the edits, but I'm pretty convinced by them, seeing as how there are multiple alternatives with the same data.
it’s really not that hard to

git clone

git set origin…

It’s much harder to copy Google’s index.

You think it's practical to do this with almost all the public repos on Github?
That's not Github's fault or Github's problem, from an antitrust perspective. If they went out of their way to make it difficult, you might have an argument but, as far as I know, they aren't. It's just practically difficult by the nature of the problem.
They rate limit, so they do make it difficult though
They rate limit to protect their infrastructure, not to make it difficult. This is not anticompetitive.
So they still have an advantage then ?

Microsoft can continually train on the majority open source and or public code with zero limitations while others can’t ?

Yeah, I think so.