lindera-morphology/lindera

A multilingual morphological analysis library.

RustMIT

Issues

`lex.csv` size may increase unnecessarily when turned into `dict.words`
#374 opened 4 months ago by BlueGreenMagick
0
Not using correct right_id to calculate cost?
#368 opened 5 months ago by BlueGreenMagick
1
Integer overflow in lindera-filter
#326 opened a year ago by JojiiOfficial
3
Failed to publish crates to crates.io due to dependencies without versions in workspace.dependencies
#358 opened 6 months ago by mosuka
0
Unable download UniDic form clrd.ninjal.ac.jp
#197 opened 2 years ago by mosuka
4
How about using Workspace dependency?
#346 opened 6 months ago by higumachan
1
Example for how to print word definitions
#343 opened 9 months ago by 641i130
1
Google is returning a 500 error when downloading the dictionnary
#336 opened 10 months ago by Kerollmops
2
Tokenizers throughput decrease a lot on long text.
#334 opened a year ago by fmassot
3
Support compressed dictionaries
#333 opened a year ago by JojiiOfficial
0
Add Japanese completion token filter
#254 opened a year ago by mosuka
0
Add Extended mode
#321 opened a year ago by mosuka
0
Migrate UniDic3
#216 opened 2 years ago by mosuka
0
What is the expected Lindera throughput (MB/s)?
#318 opened a year ago by fmassot
1
Failed to create tokenizer on v0.22.0
#311 opened a year ago by RShirohara
3
Documentation issue around UserDictionaryConfig
#294 opened a year ago by tokuhirom
1
Add Japanese iteration mark character filter
#252 opened 2 years ago by mosuka
0
Add Japanese number token filter
#251 opened 2 years ago by mosuka
0
Add Japanese compound noun token filter
#244 opened 2 years ago by mosuka
0
Write documents
#274 opened 2 years ago by mosuka
0
Add Korean part-of-speech keep token filter
#250 opened 2 years ago by mosuka
0
Add Korean reading token filter
#262 opened 2 years ago by mosuka
0
Add Korean part-of-speech stop token filter
#249 opened 2 years ago by mosuka
0
Add Korean number token filter
#263 opened 2 years ago by mosuka
0
Add IPADIC base form token filter
#245 opened 2 years ago by mosuka
0
Add UniDic base form token filter
#246 opened 2 years ago by mosuka
0
Add UniDic reading form token filter
#248 opened 2 years ago by mosuka
0
Add IPADIC reading form token filter
#247 opened 2 years ago by mosuka
0
Add upper case token filter
#243 opened 2 years ago by mosuka
1
Add lower case token filter
#242 opened 2 years ago by mosuka
1
Add Japanese part-of-speech keep token filter
#241 opened 2 years ago by mosuka
0
Add Japanese part-of-speech stop token filter
#240 opened 2 years ago by mosuka
0
Add n-gram token filter
#253 opened 2 years ago by mosuka
0
Add analyzer framework
#168 opened 2 years ago by mosuka
1
Support CC-CEDICT user dictionary
#162 opened 2 years ago by mosuka
0
Support ko-dic user dictionary
#161 opened 2 years ago by mosuka
0
Support UniDic user dictionary
#160 opened 2 years ago by mosuka
0
Lindera doesn’t build
#202 opened 2 years ago by irevoire
5
Build binary using UniDic with GitHub Actions
#201 opened 2 years ago by mosuka
0
Downloading and decompressing dictionaries takes a lot of time
#182 opened 2 years ago by Kerollmops
9
Question for user dictionary parsing when using non-compressed local dictionary
#196 opened 2 years ago by ypenglyn
5
Support compressed user dictionary
#191 opened 2 years ago by mosuka
0
Add workflows that run benchmarks on the main branch
#181 opened 2 years ago by ManyTheFish
0
Lindera-ipadict randomly as issue during build
#158 opened 2 years ago by ManyTheFish
10
Docs for 0.10 failed
#152 opened 2 years ago by PSeitz
2
Avoid building dictionaries not specified in features
#155 opened 2 years ago by mosuka
0
Reconsider default LZMA dependency without any option to avoid it
#141 opened 2 years ago by ManyTheFish
3
Compresses dictionaries for morphological analysis by default.
#136 opened 2 years ago by mosuka
0
Make some functions to private in Formatter
#120 opened 3 years ago by johtani
0
Add field_length argment to parse_unk()
#117 opened 3 years ago by johtani
0