Pinned Repositories
apertium-tsx-lint
lint for Apertium tagger specification files
arboratrix
Arboratrix is a click-drag-and-drop graphical environment to create and maintain parse trees as XML or as LaTeX.
fnTBL
gramadanj
Java port of Gramadan
internostrum-to-lttoolbox
interNOSTRUM to lttoolbox dictionary converter prototype
lemonGAWN
WordNet Gaeilge in lemon; linked lexical data for Irish
ngramtool
tag-clusterer
tag-clusterer. It clusters tags. Generates .tsx.
tesseract-ocr
Tesseract clone
wolnelektury-speech-corpus
jimregan's Repositories
jimregan/wolnelektury-speech-corpus
jimregan/notes
jimregan/not-the-30pct-seminar
jimregan/OverFlow
Putting flows on top of neural transducers for better TTS
jimregan/tesseract-gle-uncial
Automatically exported from code.google.com/p/tesseract-gle-uncial
jimregan/cmudict
CMU US English Dictionary
jimregan/corpuscrawler
Crawler for linguistic corpora
jimregan/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
jimregan/g2p_correction
Web-application for grapheme-to-phoneme correction using user feedback
jimregan/irish-asr-data
jimregan/langdata
Source training data for Tesseract for lots of languages
jimregan/language-resources
Datasets and tools for basic natural language processing.
jimregan/magi-alignment-utils
jimregan/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
jimregan/morfeusz_git
jimregan/NeMo
NeMo: a toolkit for conversational AI
jimregan/NeMo-text-processing
NeMo text processing for ASR and TTS
jimregan/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
jimregan/pygramadan
jimregan/rbg2p
Utilities for rule based, manually written, grapheme to phoneme rules
jimregan/sbtal_riksdag_asr
jimregan/sjoestedt-jonval-description
jimregan/sync-asr
jimregan/UD_Irish
Irish data
jimregan/UD_Irish-Cadhan
jimregan/uninum
A database of number names for 186 languages, locales, and scripts
jimregan/wav2vec2-riksdag-api-alignments
jimregan/waxholm
jimregan/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
jimregan/wordnet-gaeilge
Automatically exported from code.google.com/p/wordnet-gaeilge