interpretml/interpret-text

Issue with text_classification_classical_text_explainer.ipynb

fatosismali opened this issue · 3 comments

When executing the following cell:
classifier, best_params = explainer.fit(X_train, y_train)

It results with the following error:
ValueError: empty vocabulary; perhaps the documents only contain stop words

Using the same data set as in the example notebook - haven't changed anything in the code.

I had a similar issue, using an older version of spacy (2.3.7) package on pypi fixed it, looks like the tokenizer code needs to be updated to latest spacy

see related issue:
#176

Hi... Is the issue solved? Am facing the same error ValueError: empty vocabulary; perhaps the documents only contain stop words. When trying to use explainer.fit(text_train, y_train_encoded) for classification.