mshakirDr
C#, R, Python, (Perl) coder, corpus linguist, post doctoral researcher at University of Münster
Pinned Repositories
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
langdata_lstm
MFTE
MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include semantic tags from Biber (2006) and Biber et al. (1999), including other specific tags.
MultiFeatureTaggerEnglish
A Multi-Feature Tagger of English originally designed for multi-feature/multi-dimensional analysis (MDA) (Biber 1988; 1995) of situational variation in standard written and spoken English
stanza
Official Stanford NLP Python Library for Many Human Languages
UrduOCRRelated
Data and text files for training Tesseract 4 for Urdu language
UrduTessTrainingText
A simple CLI to generate training text for Tesseract 4
private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
mshakirDr's Repositories
mshakirDr/MFTE
MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include semantic tags from Biber (2006) and Biber et al. (1999), including other specific tags.
mshakirDr/UrduOCRRelated
Data and text files for training Tesseract 4 for Urdu language
mshakirDr/UrduTessTrainingText
A simple CLI to generate training text for Tesseract 4
mshakirDr/langdata_lstm
mshakirDr/MultiFeatureTaggerEnglish
A Multi-Feature Tagger of English originally designed for multi-feature/multi-dimensional analysis (MDA) (Biber 1988; 1995) of situational variation in standard written and spoken English
mshakirDr/stanza
Official Stanford NLP Python Library for Many Human Languages