Issues
- 1
Normalize function in sentence_bleu
#3250 opened by Razbolt - 1
SnowballStemmer: how to avoid transliteration?
#3249 opened by satyrmipt - 0
Downloader race condition with multiple processes
#3248 opened by naktinis - 1
stem accuracy
#3239 opened by Moustafa1Rizk1 - 0
- 2
- 20
Add support for a `sort` argument in WordNet methods
#3193 opened by bryant1410 - 2
AttributeError: module 'numpy' has no attribute 'int'.
#3170 opened by Abonia1 - 0
Duplicates in wordnet hypernyms closure
#3244 opened by ekaf - 0
UTF-8 codec can't decode byte 0×e9 in position 122
#3241 opened by ikrammohamdi - 1
- 0
Best NLTK books
#3236 opened by StepHaze - 3
ImportError: mach-o file, but is an incompatible architecture (have 'arm64', need 'x86_64')
#3171 opened by HuaYuXiao - 1
A potential edge case for WordNetLemmatizer.lemmatize()
#3227 opened by bowenyi-umich - 2
- 1
Reversed y labels in dispersion_plot
#3235 opened by kvmilos - 2
Dear Jan Strunk
#3166 opened by hiDevman - 1
Download somehow blocked
#3185 opened by sjkoelle - 1
i want to write python script i have italian text files that who i verify my word in italian dictionery please solve
#3234 opened by imtiaz231 - 2
Bug in nltk.draw.dispersion_plot with nltk 3.8.1, matplotlib-base 3.8.0, matplotlib-inline 0.1.6 and numpy 1.26
#3206 opened by m-d-grunnill - 2
Dispersion Plot was not populating in correct order on Y axis. I have corrected that order. Please use the below code in dispersion.py file.
#3212 opened by DS3006 - 7
KneserNeyInterpolated has problem with OOV words during testing and perplexity is always inf
#3211 opened by nilinykh - 2
word_tokenize() Failed to Split English Contractions When Followed by [\t\n\f\r]
#3189 opened by donglihe-hub - 2
module 'nltk' has no attribute 'data
#3228 opened by peronc - 3
import error with numpy 1.24.4
#3226 opened by mcdominik - 5
Missing English words in words()
#3186 opened by BaGRoS - 1
- 2
Not able to download the NLTK data module (python as well as manual download)
#3220 opened by subhra-ranjan-padhy - 0
`TreebankWordDetokenizer().detokenize()` introduces unexpected spaces before periods.
#3210 opened by Alnusjaponica - 5
not download punkt
#3187 opened by NIRA02525 - 0
Tokenizer punkt zip file sometimes does not unpackage
#3208 opened by ryonsteele - 1
NLTK thinks `turn` is a noun when it shoud be a verb.
#3197 opened by alf1e - 4
NLTK is considering "hi" and "hello" as a noun.
#3198 opened by RishitAtwal - 1
ToktokTokenizer doesn't call one of the included replacement patterns and thus doesn't tokenize some punctuation, like opening guillemets
#3202 opened by alexrudnick - 3
`corpus_bleu` function does not catch all the expections when calling `weights[0][0]`
#3204 opened by zhaochenyang20 - 1
Import of Trie fails in mwe.py
#3200 opened by passionate-zebracorn - 1
Problems Running Examples Starting with Babelize
#3196 opened by mdebellis - 0
Add a function of splitting combined words.
#3195 opened by wxz - 2
Unable to download Stopwords and also unable to access stopwords zip file manually.
#3194 opened by mdabdulrahman - 1
Trouble with installation importing nltk
#3192 opened by davidam - 0
Potential Regex Denial of Service (ReDoS)
#3191 opened by ready-research - 24
I tried everything and still I get: [nltk_data] Error loading taggers: Package 'taggers' not found in [nltk_data] index
#3177 opened by venturaEffect - 0
In CoreNLPParser, how can I get output as different formats, e.g., 'wordsAndTags' or 'typedDependencies'
#3184 opened by Lopa07 - 0
Formatargspec Warning in import line
#3182 opened by nvenkat94 - 0
edit_distance_align() in distance.py gives wrong alignment path when substitution_cost is greater than 2
#3181 opened by yzhaoinuw - 3
Adding LEPOR - A machine translation evaluation metric.
#3176 opened by ulhaqi12 - 0
punkt model for Arabic
#3173 opened by abdollahpour - 0
- 0
- 0