-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
201 bugfix remove references to sites where data originated from #229
201 bugfix remove references to sites where data originated from #229
Conversation
Previous commit kept original files as well as anonymised versions.
Hi @pcw24601, thanks for this contribution - could you please change the target branch of this PR to release/0.7.0? This is the branch where we will stage releases and will merge the changes together into the main branch as each release is finalised. |
Hi @laurencejackson, I think I've done that now--let me know if it's OK (I'm off for a week now so there may be a delay in getting a reply). |
Thanks for this. Did you anonymise every DICOM in |
Yes--I anonymised every file using the algorithm below (very slightly adapted from Dicognito). A couple of points worth noting:
Algorithm
|
Thanks for info Paul. Nice, I've never seen dicognito before. Just used pydicom in the past when anonymising my own stuff. True re: Github history, but definitely better to sort before hazen becomes more widely adopted so fewer people don't have the files on their machines. |
There are definite advantages to anonymising by hand, for example, you can hunt through the private fields. However, it's a slow process especially for phantom data were it's unlikely patient data will be exposed. |
820ffbe
to
992574c
Compare
Error on my part – merged main into here, forgetting that Paul had updated the files in this PR. Hence, the files he changed were overwritten with newer files. I've force-pushed back to his previous most recent commit. |
…es-where-data-originated-from
@tomaroberts – reminder to self, merge all the other PRs into main for the next release, then merge this one last. Should be easier to confirm that file changes are stable. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Checked across repo to ensure any old filenames have been adjusted correctly. Tested locally with pytest. GHA tests passing. LGTM.
Remove references to site locations from file/directory names and anonymises all dicom files in tests\ directory.