|
|
|
|
|
by chdove
512 days ago
|
|
Hmm, well the super secret stuff we’re working on comes directly to mind, but if I set that aside, boring entity resolution is actually a big pain point. Regardless of their sophistication, 3rd party data products in football tend to rely on manually collected and maintained player metadata. It can be unreliable. If I could reliably have a durable unique ID for every player, manager, and team in world football along with reliable timestamps for every moment each said player entered and left play, that would be pretty great. When joining together disparate data sources, discrepancies in things this simple cause all sorts of pain downstream. |
|
https://moj-analytical-services.github.io/splink/
(Disclaimer: I am the lead author, but the tool is FOSS)