JohnSnowLabs/spark-nlp-workshop

Named Entity Labels using the RegexMatcher annotator

Dekermanjian opened this issue · 1 comments

I am trying to train a NERDL model. I am able to assemble my data using the DocumentAssembler -> SentenceDetector -> Tokenizer annotators. I need to generate my label column, which in my case will be a binary label 'software tool' and 'Other'. I am using the RegexMatcher to detect my labeled software tools, but I am unsure about how to generate the 'Other' class. Also I am not sure the RegexMatcher will work because the result from the annotator is not a named entity, rather it is a chunk. I looked through the documentation and could not find a labeler annotator for NERDL.

Any help/suggestions are appreciated.

This not the correct repository to post this issue