linguistic-analysis
There are 145 repositories under linguistic-analysis topic.
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
brucewlee/lingfeat
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
jtanwk/nytcrossword
An exploration of New York Times crossword answers from 1994-2017, i.e. the Will Shortz era.
LSYS/LexicalRichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
THU-KEG/ChatLog
⏳ ChatLog: Recording and Analysing ChatGPT Across Time
sillsdev/FieldWorks
FieldWorks is a suite of software tools for language and cultural data, with support for complex scripts.
Halvani/Constituent-Treelib
A lightweight Python library for constructing, processing, and visualizing constituent trees.
nickduran/align-linguistic-alignment
Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpora.
STRZGR/Natural-Language-Processing-with-Python-Analyzing-Text-with-the-Natural-Language-Toolkit
My solutions to selected exercises to "Natural Language Processing with Python – Analyzing Text with the Natural Language Toolkit" by Steven Bird, Ewan Klein, and Edward Loper.
jcvasquezc/phonet
Keras-based python framework to compute phonological posterior probabilities from audio files
livingtongues/living-dictionaries
Speeding the availability of language resources for endangered languages. Tools such as this have the power to shift how we think about endangered languages. Rather than perceiving them as being antiquated, difficult to learn and on the brink of vanishing, we see them as modern, easily accessible for learning online in text and audio formats.
fidelisrafael/esperanto-analyzer
Morphological and syntactic analysis of Esperanto sentences
NEU-DSG/dailp-encoding
Digital Archive of American Indian Languages Preservation and Perseverance
hoangsonww/Amazon-Reviews-Analysis
🧐 This project analyzes Amazon Fine Food Reviews to investigate whether negative reviews are more emotionally intense and lexically repetitive than positive ones. Using R, we apply sentiment analysis and lexical diversity metrics to uncover patterns in consumer review language.
hoangsonww/Malawian-CiTonga-Tone-Production
🇲🇼 A project analyzing how onset consonant type affects tone realization in Malawian CiTonga verb stems, using pitch (F₀) data from phonetic fieldwork. Includes two experiments comparing mean F₀ across tonal and consonantal contexts, with statistically significant findings and clear visualizations.
hoangsonww/Pokemon-Name-Physique-Analysis
🐱 A project exploring relationships between Pokémon names and physical traits using R, with string-based pattern detection, group comparisons based on consonant “heaviness,” and regression models predicting weight from height and Attack. Includes hypothesis-driven name analyses and statistical summaries for both English and Japanese name sets.
hoangsonww/Brazilian-Portuguese-Nonce-Accessbility
🇧🇷 A project for analyzing acceptability judgments of Brazilian Portuguese nonce words using R, focusing on syllable length and initial segment type. Includes mosaic plots and chi-square tests to assess structural effects on responses, with results suggesting no significant influence from either factor.
korpling/graphANNIS
This is a new backend implementation of the ANNIS linguistic search and visualization system.
n3a9/vera
Winner of LA Hack's Award Best Use of Wolfram Tech 🎉 An AI system to determine if a given statement is true or false.
katreparitosh/Discourse-Analytics-of-Political-Speech-Transcripts
Political Discourse Analysis (PDA) of Political Speech Transcripts using Natural Language Processing (NLP)
i-amritpal/Feature-based-fake-review-detection
This project related to one of my B.Tech final year project that investigates the influence of linguistic and sentiment analysis features on detecting fake reviews in e-commerce (Amazon).
TALP-UPC/saga
SAGA - Phonetic transcription software for all Spanish variants.
matthias-stemmler/annimate
Your Friendly ANNIS Match Exporter
public-law/readability
How readable is your text? Provide a text input and get its grade level. Validated against the source data.
fidelisrafael/esperanto-analyzer-react
Front-end application for 'Esperanto Grammar Analyzer' built with React.js.
jklu-jaipur/Political-Biasness-Detection
Our ML model calculates the biasness of a political article based on linguistic features and classifies them as biased towards the ruling government, bias towards the opposition, or neutral.
audreycs/ImpScore
A repository for paper ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences accepted to ICLR 2025.
Itabashi-don/Shiina
板橋在住の女子高生、しいちゃんですっ( ˙꒳˙ )
unrealtecellp/life
Linguistic Field Data Management and Analysis System [LiFE]
arjo129/LangCluster
A visuallization for cognates in various languages and how they spread
devSuchit/nlp-cky-PCFG
This repository contains an implementation of the CKY parsing for English. (NLP)
GiellaLT-Archive/giella-shared
Shared linguistic resources, like names, digits, fst filtering and dependency parsing.
bhalla98/LinguisticTagger
Segments natural language text and tags it with different parts of speech.
Abe-Alefew/LexiLink
The aim of this mini-project is to to analyze the text and phonemic similarities between the Afan Oromo and Somali languages by examining word frequency, overlap, and phonemic distribution.
jjordanoc/robust-english-speech-fluency-classification
Fluency level classifier of L2 English speech
mmmaurer/elfen
A python package to efficiently extract linguistic features for text/NLP datasets