|
|
|
|
|
by citilife
1712 days ago
|
|
It would be possible to build a similar system via a library my team has built: https://github.com/capitalone/dataprofiler Effectively, you can monitor changes between profiles: data1 = dp.Data("file_a.csv") # Load a CSV file profile1 = dp.Profiler(data1) # Generate a profile data2 = dp.Data("file_b.csv") # Load another CSV file profile2 = dp.Profiler(data2) # Generate another profile diff_report = profile1.diff(profile2) print(json.dumps(diff_report, indent=4)) The system we have generates reports, it might be worth adding it OP. |
|