urchade/GLiNER

GliNER marking pronouns as PERSON

Closed this issue · 9 comments

GliNER is tagging pronouns like I, you, we, they as PERSON entity type.

These are not named entities and should not be picked by the model.

Only named entities (person names/nouns) should be picked.

Screenshot 2024-06-12 at 10 07 15 AM Screenshot 2024-06-12 at 10 07 05 AM Screenshot 2024-06-12 at 10 06 42 AM Screenshot 2024-06-12 at 10 06 31 AM Screenshot 2024-06-12 at 10 06 21 AM

Unfortunately, this was the case in the training dataset . Try to change the label name

when you say changing label name ... can you please elaborate on it?

In instead of using "person" try another alternative like "person name"

Unfortunately its not working with a changed label.

Screenshot 2024-06-13 at 3 11 03 PM

However increasing the threshold does seems to ignore pronouns but that would mean trading off the detection rate?

Screenshot 2024-06-13 at 3 11 35 PM

there are infinite possible label name, try to find the one that works

or change the model you use. Which model version is this ?

I am using gliner_multi_pii-v1 through a spacy pipe integrated with Presidio.

I see presidio follows its own entity labels.

if you are interested in pii, you can check this blogpost: https://ai.gopubby.com/zero-shot-named-entity-recognition-ner-c81f82cb0f70

Thanks for sharing. Let me try other models in our integration .... especially the gliner_medium_news-v2.1

Hi,

While I have closed this issue, can you please help me understand if it can detect entities from chinese simplified text?

Screenshot 2024-06-13 at 4 03 37 PM

Screenshot 2024-06-13 at 4 03 52 PM

Officially on the page its mentioned to support 6 languages : English, French, German, Spanish, Italian and Portugese.