text-normalization
There are 51 repositories under text-normalization topic.
jfilter/clean-text
🧹 Python package for text cleaning
speechio/chinese_text_normalization
Chinese text normalization for speech processing
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
ikegami-yukino/neologdn
Japanese text normalizer for mecab-neologd
snakers4/russian_stt_text_normalization
Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks
greenlikeorange/knayi-myscript
Myanmar Language Script Library
cognibit/Text-Normalization-Demo
Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain
Isminoula/TextNormSeq2Seq
Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSM'19
csebuetnlp/normalizer
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
tomaarsen/TTSTextNormalization
Convert English text from written expressions into spoken forms
kscanne/caighdean
Inneall aistriúcháin atá taobh thiar de Chaighdeánaitheoir na Gaeilge, agus aistritheoirí Gàidhlig/Gaelg→Gaeilge
sugatagh/E-commerce-Text-Classification
Proper categorization of e-commerce products enhances the user experience and achieves better results with external search engines. The objective of the project is to classify a product into four given categories, based on its description available on an e-commerce platform.
ecomp-shONgit/text-normalisation
JS / Python3 / PHP Lib to work with UTF8 polytonic greek and latin
312shan/Text-Normalization-in-pyTorch
pyTorch implementation for Text Normalization Challenge
seanghay/tha
📢 Tha (ថា) - A Khmer Text Normalization and Verbalization Toolkit
ducnt18121997/Viet-Text-Normalization
A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehensive text normalization capabilities including handling of special characters, numbers, dates, and various text formats.
esentis/string_extensions
Useful String extensions to save you time in production.
cewarman/NTPU_online_text_normalization
An online text normalization tool for Chinese-English mixed text-to-speech system
rafalposwiata/text-normalization
Repository for text normalization research.
ZRktty/accent-folding
A JavaScript library for accent-insensitive text processing, including accent folding and search term highlighting
cadia-lvl/althingi-asr
An ASR recipe and speech corpus of Icelandic parliamentary speeches
khanhtran2000/FPT.AI_2020
My work during internship at FPT.AI 2020
kscanne/droichead
Nascanna idir Foclóir Uí Dhónaill agus DIL
vietbtx/ViTextnormASR
Our source code for the paper "Transformer-based Joint Learning Approach for Text Normalization in Vietnamese ASR"
Amir79Naziri/TextNormalization_Project
Implementing text normalization for Farsi(Persian) language.
areeba0/English-to-French-Translation-using-NLTK-and-Hugging-Face-Transformers-MarianMTModel
This repository provides a complete workflow for text processing using Hugging Face Transformers and NLTK. It includes modules for sentence normalization, spelling correction, word embedding generation, positional encoding computation, and English-to-French translation
JasperHG90/Phonorm
Phonetic normalization using Recurrent Neural Networks
pgolo/sic
Utility for string normalization
princ3od/VietnamNumber
Library supports converting number to Vietnamese for .NET C# ./
techecosystem/parsify-php
A PHP library for Persian text conversion, including number translation, diacritics removal, and normalization with a fluent API.
neelpy/SMS-Text-Normalization-HMM-MEMM
Implementation of the paper on Text normalization by Choudhury et al.
rezasarkhosh/NLP-QA
Cryptocurrency Market Analysis and Question Answering System
sudheer1098/Spam-Classifier-NLP
A web app for Spam classification using Natural Language Processing.
vn33/Ecommerce-Product-Categorization
Accurate categorization of eCommerce products improves user experience and boosts search engine visibility. The project goal is to classify products into 14 predefined categories using their descriptions sourced from an eCommerce platform.