Pinned Repositories
acl-anthology
Data and software for building the ACL Anthology.
Akkademia
Translating Akkadian signs to transcriptions using NLP techniques such as HMM, MEMM and BiLSTM neural networks.
Assyrian-Dictionaries
Collection of Assyrian Dictionaries
ctc_sampling
Code for Sampling from Stochastic Finite Automata with Applications to CTC Decoding
finite_state
Read-only pre-release mirrors of OpenFst, OpenGrm N-Gram, OpenGrm Pynini, OpenGrm SFst and OpenGrm Thrax libraries.
google-research
Google Research
mozolm
MozoLM: A language model (LM) serving library
nisaba
Finite-state script normalization and processing utilities
eidos-audition
Collection of auditory models.
language-resources
Datasets and tools for basic natural language processing.
agutkin's Repositories
agutkin/acl-anthology
Data and software for building the ACL Anthology.
agutkin/finite_state
Read-only pre-release mirrors of OpenFst, OpenGrm N-Gram, OpenGrm Pynini, OpenGrm SFst and OpenGrm Thrax libraries.
agutkin/Akkademia
Translating Akkadian signs to transcriptions using NLP techniques such as HMM, MEMM and BiLSTM neural networks.
agutkin/awesome-nlp-resource
awesome nlp resource
agutkin/bible-corpus
A multilingual parallel corpus created from translations of the Bible.
agutkin/corpora
Public repository for Coptic SCRIPTORIUM Corpora Releases
agutkin/fancyimpute
Multivariate imputation and matrix completion algorithms implemented in Python
agutkin/google-research
Google AI Research
agutkin/grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
agutkin/hgcn
Hyperbolic Graph Convolutional Networks in PyTorch.
agutkin/HornMorpho
Morphological processing for languages of the Horn of Africa
agutkin/hyperbolic-image-embeddings
Supplementary code for the paper "Hyperbolic Image Embeddings".
agutkin/ie-datasets
Training data for Tibetan nlp.
agutkin/indicnlp_catalog
A collaborative catalog of NLP resources for Indic languages
agutkin/nena_corpus
The NENA corpus in plain-text markup
agutkin/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
agutkin/OpenNMT-py
Open Source Neural Machine Translation in PyTorch
agutkin/openslr
Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository
agutkin/pandoc
Universal markup converter
agutkin/parti-pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
agutkin/poincare-embeddings
PyTorch implementation of the NIPS-17 paper "Poincarรฉ Embeddings for Learning Hierarchical Representations"
agutkin/redash
Dasher text entry in HTML, CSS, JavaScript, and SVG
agutkin/SignalResampler
Signal Resampler for C++
agutkin/sigtyp.github.io
agutkin/slpat2022
Data for the experiments for the 9th Workshop on Speech and Language Processing for Assistive Technologies
agutkin/ST2022
SIGTYP 2022 Shared Task
agutkin/text-fabric
File format, model, API, and apps for manipulating text and its annotated features
agutkin/utfcpp
UTF-8 with C++ in a Portable Way
agutkin/wikipron
Massively multilingual pronunciation mining
agutkin/XLM
PyTorch original implementation of Cross-lingual Language Model Pretraining.