Issue with text_classification_classical_text_explainer.ipynb

Question

Issue with text_classification_classical_text_explainer.ipynb

fatosismali opened this issue 3 years ago · 3 comments

When executing the following cell:
classifier, best_params = explainer.fit(X_train, y_train)

It results with the following error:
ValueError: empty vocabulary; perhaps the documents only contain stop words

Using the same data set as in the example notebook - haven't changed anything in the code.

Answer 1 · 2022-02-02T14:28:31.000Z

I had a similar issue, using an older version of spacy (2.3.7) package on pypi fixed it, looks like the tokenizer code needs to be updated to latest spacy

Answer 2 · 2022-02-02T14:29:04.000Z

see related issue:
#176

Answer 3 · 2022-11-29T11:20:55.000Z

Hi... Is the issue solved? Am facing the same error ValueError: empty vocabulary; perhaps the documents only contain stop words. When trying to use explainer.fit(text_train, y_train_encoded) for classification.