Issues
- 0
`lex.csv` size may increase unnecessarily when turned into `dict.words`
#374 opened by BlueGreenMagick - 1
Not using correct right_id to calculate cost?
#368 opened by BlueGreenMagick - 3
Integer overflow in lindera-filter
#326 opened by JojiiOfficial - 0
Failed to publish crates to crates.io due to dependencies without versions in workspace.dependencies
#358 opened by mosuka - 4
Unable download UniDic form clrd.ninjal.ac.jp
#197 opened by mosuka - 1
How about using Workspace dependency?
#346 opened by higumachan - 1
Example for how to print word definitions
#343 opened by 641i130 - 2
- 3
Tokenizers throughput decrease a lot on long text.
#334 opened by fmassot - 0
Support compressed dictionaries
#333 opened by JojiiOfficial - 0
Add Japanese completion token filter
#254 opened by mosuka - 0
Add Extended mode
#321 opened by mosuka - 0
Migrate UniDic3
#216 opened by mosuka - 1
What is the expected Lindera throughput (MB/s)?
#318 opened by fmassot - 3
Failed to create tokenizer on v0.22.0
#311 opened by RShirohara - 1
Documentation issue around UserDictionaryConfig
#294 opened by tokuhirom - 0
Add Japanese iteration mark character filter
#252 opened by mosuka - 0
Add Japanese number token filter
#251 opened by mosuka - 0
Add Japanese compound noun token filter
#244 opened by mosuka - 0
Write documents
#274 opened by mosuka - 0
Add Korean part-of-speech keep token filter
#250 opened by mosuka - 0
Add Korean reading token filter
#262 opened by mosuka - 0
Add Korean part-of-speech stop token filter
#249 opened by mosuka - 0
Add Korean number token filter
#263 opened by mosuka - 0
Add IPADIC base form token filter
#245 opened by mosuka - 0
Add UniDic base form token filter
#246 opened by mosuka - 0
Add UniDic reading form token filter
#248 opened by mosuka - 0
Add IPADIC reading form token filter
#247 opened by mosuka - 1
Add upper case token filter
#243 opened by mosuka - 1
Add lower case token filter
#242 opened by mosuka - 0
Add Japanese part-of-speech keep token filter
#241 opened by mosuka - 0
Add Japanese part-of-speech stop token filter
#240 opened by mosuka - 0
Add n-gram token filter
#253 opened by mosuka - 1
Add analyzer framework
#168 opened by mosuka - 0
Support CC-CEDICT user dictionary
#162 opened by mosuka - 0
Support ko-dic user dictionary
#161 opened by mosuka - 0
Support UniDic user dictionary
#160 opened by mosuka - 5
Lindera doesn’t build
#202 opened by irevoire - 0
Build binary using UniDic with GitHub Actions
#201 opened by mosuka - 9
- 5
Question for user dictionary parsing when using non-compressed local dictionary
#196 opened by ypenglyn - 0
Support compressed user dictionary
#191 opened by mosuka - 0
- 10
Lindera-ipadict randomly as issue during build
#158 opened by ManyTheFish - 2
Docs for 0.10 failed
#152 opened by PSeitz - 0
Avoid building dictionaries not specified in features
#155 opened by mosuka - 3
- 0
- 0
Make some functions to private in Formatter
#120 opened by johtani - 0
Add field_length argment to parse_unk()
#117 opened by johtani