ValueError: Transformation generated invalid chunkstring

Question

ValueError: Transformation generated invalid chunkstring

Kowsalya-Mouttouramane opened this issue 3 years ago · 2 comments

Kowsalya-Mouttouramane commented 3 years ago

test = ["Les voitures autonomes déplacent la responsabilité de l'assurance vers les constructeurs"]
vectorizer_fr = KeyphraseCountVectorizer(spacy_pipeline='fr_dep_news_trf', pos_pattern='<N.*>+', stop_words ='french')
vectorizer_fr.fit(test)

It generates a valueError : Transformation generated invalid chunkstring:
<><><><><><><><><><><><>

This works with other languages, the problem is only with the french spacy models (whatever french model).
Can anyone help me solve this error, please ?

Answer 1 · 2022-05-16T16:02:35.000Z

Hi @Kowsalya-Mouttouramane,
You can check issue #2. In case of the French pipeline, you need to add the transformer pipeline component.
You can use my fork or create a pull request to activate custom components.

Answer 2 · 2022-06-18T17:10:57.000Z

Closing this as duplicate to issue #2