Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove Metadata such as scan date, image ID #2

Open
jetic83 opened this issue Jul 2, 2018 · 3 comments
Open

Remove Metadata such as scan date, image ID #2

jetic83 opened this issue Jul 2, 2018 · 3 comments

Comments

@jetic83
Copy link

jetic83 commented Jul 2, 2018

Is there a way to delete certain fields in the scans such as comments or scan date or filename? These metainformation can be considered as PHI, or as non-anonymized data.

@a-dev-walker
Copy link

Has anyone ever followed up on this issue because it would make anonymize-slide much more useful. As of now, the remaining PHI within the metadata are proving to be an issue.

@markemus
Copy link

It can be done. Examples from a PR:

fh.directories[1].entries[XMLPACKET].overwrite_entry(our_xmp) fh.directories[1].entries[IMAGE_DESCRIPTION].overwrite_entry(our_image_desc)

These overwrite the XMLPACKET and IMAGE_DESCRIPTION tags on directory #1. To delete, overwrite with safe values. Make sure to overwrite with a message the same length as the original otherwise the data might still be there.

@Tomatenbiss
Copy link

Tomatenbiss commented Nov 14, 2022

@jetic83, @a-dev-walker: Within the EMPAIA project, we have now developed our own solution for anonymizing WSIs (in various formats) including all the sensitive metadata. This is currently available via Gitlab . The paper for this is currently in review, the preprint can already be viewed at arXiv.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants