segment-any-text/wtpsplit
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
PythonMIT
Issues
- 8
- 5
Newlines are treated like spaces
#131 opened by markus583 - 0
Update README for LoRA
#132 opened by markus583 - 4
Korean text is not split well
#133 opened by seungduk-yanolja - 7
Canine model and High VRAM usage
#115 opened by Qubitium - 2
- 1
- 1
- 4
- 8
Failed to Adapt to your own corpus via LoRA
#123 opened by 12eue - 3
Update transformers dependency
#124 opened by carschno - 6
sat-12l-sm running on GPU
#120 opened by Randwow - 2
Error when installing the requirements
#122 opened by RacheleSprugnoli - 1
CUDA device error when segmenting Greek text
#121 opened by ayalda - 2
SaT is slow
#118 opened by thegenerativegeneration - 2
Run models for Italian
#119 opened by RacheleSprugnoli - 5
Inconsistent results with same sentences
#103 opened by asusdisciple - 11
Model(s) use word capitlisation to segment
#101 opened by intelliqua - 3
- 4
Huggingface AutoModelForTokenClassification bug
#112 opened by asusdisciple - 1
Accuracy: Error in Split (EN)
#113 opened by Qubitium - 2
remove_repetition
#111 opened by mmichelli - 2
Recursion in init
#96 opened by jonvaughan - 3
How to use Universal Dependencies style ?
#108 opened by Lix1993 - 1
model in huggingface cannot load mixtures.skops
#110 opened by syeelou - 0
Rust bindings for wtpsplit
#109 opened by turulix - 1
Could not find a mixture for the Universal Dependencies (UD) style in Thai language
#107 opened by pavaris-pm - 2
- 1
Scoring metric, does definition make sense?
#104 opened by asusdisciple - 2
- 7
Async - Skops import is failing
#100 opened by MathiasExorde - 1
Opus100 FR not in mixtures
#99 opened by intelliqua - 1
Error loading model to GPU
#95 opened by damin604 - 1
- 3
Unusual splits in short sentence
#90 opened by rggdmonk - 4
- 1
get_threshold does not work
#91 opened by rggdmonk - 3
Apple Silicon / Arm support
#88 opened by yenson-lau - 1
- 2
- 4
Control where the model is downloaded too?
#59 opened by awhillas - 1
`AttributeError: 'InferenceSession' object has no attribute '_providers' Segmentation fault (core dumped)`
#74 opened by Errorbot1122 - 3
Hi, pip install nnsplit doesn't work
#73 opened by tartaron - 6
Python 3.10 wheel
#64 opened by cjrh - 2
Unable to use own trained onnx models
#71 opened by synweap15 - 2
Can't run it on macOS
#48 opened by TakamotoAI - 2
Error when load model
#49 opened by ziweiji - 0
Split Object to proper string
#54 opened by virdiprateek - 2
Security: update version of tract-onnx
#50 opened by cjrh - 3
ImportError in Python (NNSplit)
#45 opened by albertovilla