smeylan
I'm a computational psycholinguist who works on language processing and language acquisition
@cpllabBerkeley, CA & Cambridge, MA
Pinned Repositories
cdl_asr
An automatic speech recognition system with (PLL) neural rescoring from a BERT model of child language
child-directed-listening
Codebase for "How adults understand what young children say" (ms under review), "Child-directed Listening:How Caregiver Inference Enables Children's Early Verbal Communication" (CogSci 2021), "Characterizing Child-Directed Listening with Corpus and Model-based Analyses" (ICIS 2020)
childes-db-derived
Django app for populating derived datasets in childes-db
determiner_learning
data preparation, model, and analysis for "The emergence of an abstract grammatical category in children’s early speech"
gibbs_lda
Gibbs Sampler for LDA topic modeling, using Numba
LMZoo
Language Modeling Zoo
opus
library for extracting 2013 subtitle corpus
pyCelex
A python module for reading and organizing data from CELEX2.
shannonGame
using probabilistic language resources to play the guessing game described in Shannon (1952)
swbd
Switchboard Lexical Divergence
smeylan's Repositories
smeylan/child-directed-listening
Codebase for "How adults understand what young children say" (ms under review), "Child-directed Listening:How Caregiver Inference Enables Children's Early Verbal Communication" (CogSci 2021), "Characterizing Child-Directed Listening with Corpus and Model-based Analyses" (ICIS 2020)
smeylan/LMZoo
Language Modeling Zoo
smeylan/cdl_asr
An automatic speech recognition system with (PLL) neural rescoring from a BERT model of child language
smeylan/childes-db-derived
Django app for populating derived datasets in childes-db
smeylan/lm-scorer
📃Language Model based sentences scoring library
smeylan/pic-analysis
Analyses for "Word forms - not just their lengths - are optimized for efficient communication"
smeylan/vizier
Longitudinal study management platform... for children
smeylan/wordsim_serverized
Word Similarity Graph Server
smeylan/articulationGAN
smeylan/big_lm
Automate setup and extend (Google) big_lm
smeylan/cdl_r21_powercalc
smeylan/childes_db_tutorial
Materials for childes-db tutorial
smeylan/ciwganfiwgan-pytorch
smeylan/extractPDF
Minimal Python utility for breaking a single PDF into multiple smaller PDF based on start and end page indices
smeylan/fiwGAN-ciwGAN
fiwGAN/ciwGAN (Featural and Categorical InfoWaveGAN): Generative Adversarial Phonology and Semantics
smeylan/frequency-vs-info-content
smeylan/heroku-buildpack-geolite
smeylan/LEXSIG
smeylan/measuring-grammatical-productivity
codebase for "Measuring Grammatical Productivity" workshop
smeylan/muybridge
Code to generate submission for 2019 Art in AI competition at Duke
smeylan/ngrawk2
codebase for building n-gram language models and computing lexical and sub lexical surprisal from {Google Books, Google 1T, British National Corpus, OPUS}
smeylan/PLEARN_analysis
Analysis of storybook and eye tracking task following morphological development
smeylan/PLEARN_lookit
Code to generate and process data from the LookIt version of the plural learning experiments
smeylan/pos-tagging-example
Toy example of using spaCy to tag CHILDES glosses
smeylan/RTexVars
Make variables in an R session into named commands in LaTeX
smeylan/smeylan.github.io
personal academic website
smeylan/surprisal
Computes surprisal and other measures from language models
smeylan/telephone-analysis-public
Repository for "Evaluating Models of Robust Word Recognition with Serial Reproduction"
smeylan/word_forms
Code and analyses for "Word forms reflect trade-offs between speaker effort and robust listener recognition"
smeylan/yobiyoba_eval
testing YobiYoba transcription vs. DiViMe tools