Hacker News new | ask | show | jobs
by roter 832 days ago
> To help improve Daft, we collect non-identifiable data.

> To disable this behavior, set the following environment variable: DAFT_ANALYTICS_ENABLED=0

> [0] In short, we collect the following:

> On import, we track system information such as the runner being used, version of Daft, OS, Python version, etc.

> On calls of public methods on the DataFrame object, we track metadata about the execution: the name of the method, the walltime for execution and the class of error raised (if any). Function parameters and stacktraces are not logged, ensuring that user data remains private.

Is this telemetry really necessary?

[0] https://www.getdaft.io/projects/docs/en/latest/faq/telemetry...

2 comments

I don't think they call it "really necessary" but I assume it's handy to understand the user base and focus on the right use cases in the long run considering not all the users who use these libraries are active on Github.
It has an Apache 2.0 license. I assume it allows it to be repackaged removing telemetry.