linguistics
There are 1291 repositories under linguistics topic.
psychopy/psychopy
For running psychology and neuroscience experiments
nltk/nltk_data
NLTK Data
xiamx/awesome-sentiment-analysis
😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤
Tatoeba/tatoeba2
Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
LexPredict/lexpredict-lexnlp
LexNLP by LexPredict
BLKSerene/Wordless
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
open-dict-data/ipa-dict
Monolingual wordlists with pronunciation information in IPA
rime/rime-cantonese
Rime Cantonese input schema | 粵語拼音輸入方案
proycon/pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
theimpossibleastronaut/awesome-linguistics
A curated list of anything remotely related to linguistics
jacksonllee/pycantonese
Cantonese Linguistics and NLP
tshatrov/ichiran
Linguistic tools for texts in Japanese language
CUNY-CL/wikipron
Massively multilingual pronunciation mining
quadrismegistus/prosodic
Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
OpenCorpora/opencorpora
A web-based engine for creating and annotating textual corpora
hangulize/hangulize
Hangulize transcribes non-Korean words into Hangul
MaxBittker/nyt-first-said
Tweets when words are published for the first time in the NYT
sublee/hangulize
Korean Alphabet Transcription
google/corpuscrawler
Crawler for linguistic corpora
what-studio/tossi
Chooses correct Korean particle morphs for arbitrary words.
glottolog/glottolog
Collaborative data curation for Glottolog
CoEDL/elpis
🙊 software for creating speech recognition models.
pyconll/pyconll
A minimal, pure Python library to interface with CoNLL-U format files.
TheOpenDictionary/odict
A blazingly-fast, offline-first format and toolchain for lexical data 📖
albirrkarim/react-speech-highlight-demo
React / Vanilla JS Text to Speech with highlighting the words and sentences that are being spoken using audio files, text to speech API, and web speech synthesis API
phoible/dev
PHOIBLE data and development.
hbuschme/TextGridTools
Read, write, and manipulate Praat TextGrid files with Python
proycon/colibri-core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
anebz/papers
Curated repository of notes from papers I'm reading, mostly NLP related. Updated regularly.
proycon/flat
FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
yohasebe/rsyntaxtree
Syntax tree generator for linguistic research
nlposs/NLP-OSS
Democratizing NLP!
josecannete/spanish-corpora
Unannotated Spanish 3 Billion Words Corpora
eliranwong/OpenGNT
Open Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources
vasishth/bayescogsci
Introduction to Bayesian Data Analysis for Cognitive Science by Nicenboim, Schad, Vasishth