-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(telemetry): anonymise properly model id #1654
fix(telemetry): anonymise properly model id #1654
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why not sending the hash of the model id ?
Imported models do not have a hash, we would have to compute it on import, that feasible, but would require some work on the import side, and could lead to potential issues: what if the user import manually by editing the file, do we compute at startup ? |
@axel7083 hash of the id |
I mean, are we interested in this information for imported user ? It would be different for each imported path, I feel like having a fixed value for imported models ( |
Yes but having a distinct value may help to detect patterns in telemetry |
3a39231
to
3b81fee
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we store the hash in the model object so we only compute it once ?
Here I replicated what we do for the There a several method to import a model, through the UI or through the user-catalog editing. Therefore computing the hash of the model object (not the id) would require deeper change. IMO we should not hash and use the imported models for telemetry, as we could deduce easily the models that users are using, which seems to be personal/private information ? |
personal/private information is if you're able to identify the user (like if the name of the model is the name of the user) if you're able to see that people are using models being available on huggingface (through the hash), it's not because you can detect that someone is using that model that you know who is the person. |
Okey, yeah I agree. But hashing the model would need a deeper change here, which would need it own issue, next sprint. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, Great work!
Signed-off-by: axel7083 <[email protected]>
Signed-off-by: axel7083 <[email protected]>
Signed-off-by: axel7083 <[email protected]>
Signed-off-by: axel7083 <[email protected]>
Signed-off-by: axel7083 <[email protected]>
172b3ab
to
2531b76
Compare
What does this PR do?
Prevent leakage of user information in the telemetry
Screenshot / video of UI
N/A
What issues does this PR fix or reference?
Fixes https://github.com/containers/podman-desktop-internal/issues/325
How to test this PR?