presidio sometimes misses very crucial information such telephone number #1474
Unanswered
sadathknorket
asked this question in
Q&A
Replies: 2 comments 1 reply
-
Hi, for telephone numbers specifically, Presidio is configured to search for numbers belonging to some countries but not all. See this issue, where the For addresses and others, I would suggest to look into transformers based models. See this example (still in PR) for a full presidio customization: https://github.com/microsoft/presidio-research/blob/omri/evaluator_updates/notebooks/5_Evaluate_Custom_Presidio_Analyzer.ipynb |
Beta Was this translation helpful? Give feedback.
0 replies
-
How does it know about the region ? Through the phone number itself ? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Given that I used this tutorial https://github.com/microsoft/presidio/blob/main/docs/samples/python/example_pdf_annotation.ipynb
and this pdf example
personalpdf.pdf
The outputted pdf was
output_annotated.pdf
We could notice telephone number was missed to be annotated .
Is there any extra parameter or something I could do to avoid such mistakes ?
Beta Was this translation helpful? Give feedback.
All reactions