jhnwnstd
MA in Linguistics with a specialization in character-level language modeling. Proficient in Python and R.
Pinned Repositories
DNEED
Files from DNEED
corpus_toolkit
Python toolkit for corpus analysis: tokenization, lexical diversity, vocabulary growth prediction, entropy measures, and Zipf/Heaps visualizations.
linguist_toolkit
Python and coding tools for collecting text and audio data.
suxotin
Python script that distinguishes vowels from consonants using Suxotin's algorithm.
qgram
jhnwnstd's Repositories
jhnwnstd/linguist_toolkit
Python and coding tools for collecting text and audio data.
jhnwnstd/corpus_toolkit
Python toolkit for corpus analysis: tokenization, lexical diversity, vocabulary growth prediction, entropy measures, and Zipf/Heaps visualizations.
jhnwnstd/suxotin
Python script that distinguishes vowels from consonants using Suxotin's algorithm.