presidio sometimes misses very crucial information such telephone number #1474
Unanswered
sadathknorket
asked this question in
Q&A
Replies: 2 comments 1 reply
-
|
Hi, for telephone numbers specifically, Presidio is configured to search for numbers belonging to some countries but not all. See this issue, where the For addresses and others, I would suggest to look into transformers based models. See this example (still in PR) for a full presidio customization: https://github.com/microsoft/presidio-research/blob/omri/evaluator_updates/notebooks/5_Evaluate_Custom_Presidio_Analyzer.ipynb |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
How does it know about the region ? Through the phone number itself ? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Given that I used this tutorial https://github.com/microsoft/presidio/blob/main/docs/samples/python/example_pdf_annotation.ipynb
and this pdf example
personalpdf.pdf
The outputted pdf was
output_annotated.pdf
We could notice telephone number was missed to be annotated .
Is there any extra parameter or something I could do to avoid such mistakes ?
Beta Was this translation helpful? Give feedback.
All reactions