/HebrewStopWords

List of hebrew stop words + script that computed them

Primary LanguagePython

HebrewStopWords

This is a list of the 500 most common words (stop words) computed from discussions from the Tapuz People website, on a variety of subjects.

Original corpora contained 1,397,173 tokes.

Tokens containing English characters or digits were removed from the lists.

heb_stopwords.txt - list of stopwords

heb_stopwords_counts.txt - list of stopwords + counts