The mara example project [1] does exactly that. It combines PuPI download stats with Github repo activity data.
[1] https://github.com/mara/mara-example-project
The file directory structure is a bit confusing -- could you point me to the file that performs this transformation?
The __init__.py contains the pipeline, and the rest is the SQL files that do the transformations
The file directory structure is a bit confusing -- could you point me to the file that performs this transformation?