text-segmentation
There are 62 repositories under text-segmentation topic.
catalyst-team/catalyst
Accelerated deep learning R&D
wolfgarbe/SymSpell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
blmoistawinde/HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
ogkalu2/comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
mammothb/symspellpy
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
cbaziotis/ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
ZumingHuang/awesome-ocr-resources
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
notAI-tech/deepsegment
A sentence segmenter that actually works!
koomri/text-segmentation
Implementation of the paper: Text Segmentation as a Supervised Learning Task
sedflix/awesome-topic-segmentation
(yet another not really) awesome topic/text segmentation list
wolfgarbe/WordSegmentationTM
Fast Word Segmentation with Triangular Matrix
google/emoji-segmenter
Emoji Segmenter
DCY1117/MangaQuick
Automatic Manga Translator
viig99/SymSpellCppPy
Fast SymSpell written in c++ and exposes to python via pybind11
Jumpst3r/printed-hw-segmentation
Printed and handwritten text segmentation using fully convolutional networks and CRF post-processing
ReemHal/Semantic-Text-Segmentation-with-Embeddings
Uses GloVe embeddings and greedy sequence segmentation to semantically segment a text document into any number of k segments.
ReubenBond/HanBaoBao
Mandarin Chinese text segmentation and mobile dictionary Android app (中文分词)
eskriett/spell
Spelling correction and string segmentation written in Go
rlayers/pawpaw
Text Processing & Segmentation Framework
wolfgarbe/WordSegmentationDP
Word Segmentation with Dynamic Programming
hyunbool/Text-Segmentation
Text Segmentation 관련 논문 정리
Feoramund/ucg
UTF-8 grapheme counting library written in C99.
smart-models/Normalized-Semantic-Chunker
Cutting-edge tool that unlocks the full potential of semantic chunking
nitely/nim-segmentation
Unicode text segmentation (tr29)
npillmayer/uax
Unicode Text Segmentation Algorithms
zamgi/lingvo--TextSegmenter
Text segmentation into separate words using a simple unigram model and the Viterbi algorithm
shayneobrien/text-segmentation
Neural and nonneural text segmentation methods.
Yannael/automatic-video-chaptering
Automate video chaptering with LLMs and TF-IDF: Transform raw transcripts into well-structured documents
kushalchauhan98/ticket-segmentation
Data for the ACL 2020 paper - Improving Segmentation for Technical Support Problems
athamana/ImgAnalysisToolkit
Image Analysis Toolkit for text document Binarization & Segmentation written in TypeScript.
DhavalTaunk08/Text-Segmentation-in-Images
This project aimed to perform text segmentation in images using AutoEncoders.
QuantumWizard888/How-to-add-user-dictionary-to-MeCab
How to add user dictionary to MeCab
Chayan-halder/WBSUBNdb_text---Bangla-handwritten-text-document-dataset
"WBSUBNdb_text: Bangla handwritten text document dataset" is a Bangla text dataset containing 1383 offline handwritten text documents contributed by 190 writers. The dataset is composed of both simple and compound characters.
christophsk/segment-string
Demonstration of dynamic programming for segmenting strings into words. Just for fun!