One example is the data hosted in Google Cloud.
https://cloud.google.com/blog/topics/public-datasets/github-...