I was looking at the data from https://ossinsight.io/explore/?id=6c060cc1-83b4-43f7-be4a-a401e4a5387f. I filtered to US only but across the board there was a large crash around August 2022. Curious if anyone knows why.
When you see stuff like this happens. It’s most likely something wrong with the tool that is gathering this data. They lost data access or something funky like the max number of pages they can iterate went down
EDIT: just saw that ossinsight also uses gharchive
I didn't filter for US region though. This is not really published by the github api, only the "location" field is published where users can put everything. ossinsight probably tried to parse the country codes?
I'd probably put money on that. Not that it stopped people from creating repos but that it did something with the permissions of the tool gathering the data and the rate limiting to reduce the amount of data ingested.
that query isn’t for new Github repos, it’s for the number of github repos the counter was 1. allowed to count and 2. knew how to count. all that said it is pretty interesting 90k repos went away somehow between August 1 and September 1 last year. Good question. (Also, interesting tool. Never heard of ossinsight before.)