sentence-splitting
There are 14 repositories under sentence-splitting topic.
adobe/NLP-Cube
Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
erre-quadro/spikex
SpikeX - SpaCy Pipes for Knowledge Extraction
vngrs-ai/vnlp
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
mediacloud/sentence-splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Prismadic/magnet
the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly
zaemyung/sentsplit
A flexible sentence segmentation library using CRF model and regex rules
gosbd/gosbd
A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.
astariul/Sentencize.jl
Smallish library for sentence splitting in Julia
KorAP/Datok
High-Performance Finite State Tokenizer
M4t1ss/chunker
A sentence chunker PHP class + visualizer for Berkeley Parser parse trees
mbanon/benchmarks
Several benchmarks on sentence splitting and language identification
ptts-easy/text-classification-analyser
Sentence split, Text classfication, performanc analysis for NLP
kimryan/Lingua-EN-Sentence
split text into sentences (a Perl module)
ZJaume/splitters
A CLI for Rust SRX sentence segmenation rules as Python package.