AlexJonesNLP
ML Engineer @ Ibotta, ex-intern @ Google Translate, Dartmouth College '23
Dartmouth CollegeHanover, NH
Pinned Repositories
alt-bitexts
A set of notebooks showcasing methods for mining bitexts from parallel or comparable corpora
XLAnalysis5K
Code and data for EMNLP 2021 paper "A Massively Multilingual Analysis of Cross-linguality in Shared Embedding Space."
HMDM
Hierarchical & multilingual document mining via localized Poincaré projections
KALComp
A comparable corpus of Kalaallisut and Danish web-crawled sentences, along with some noisy aligned texts and code for MT finetuning experiments between Kalaallisut and English. Currently looking to improve the quality of pseudoparallel data. Final project for LING28/Computational Linguistics, Dartmouth College, Winter 2022.
acl-anthology
Data and software for building the ACL Anthology.
AlexJonesNLP
AlexJonesNLP.github.io
Amzn-Product-Quality-Prediction
DS-ML-Python-cheat-sheets
Interactive notebooks that walk through the basics of the most widely used Python data science and ML libraries.
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
AlexJonesNLP's Repositories
AlexJonesNLP/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
AlexJonesNLP/langchain
⚡ Building applications with LLMs through composability ⚡
AlexJonesNLP/url-nlp
AlexJonesNLP/homebrew-cask
🍻 A CLI workflow for the administration of macOS applications distributed as binaries
AlexJonesNLP/AlexJonesNLP
AlexJonesNLP/AlexJonesNLP.github.io
AlexJonesNLP/alt-bitexts
A set of notebooks showcasing methods for mining bitexts from parallel or comparable corpora
AlexJonesNLP/KALComp
A comparable corpus of Kalaallisut and Danish web-crawled sentences, along with some noisy aligned texts and code for MT finetuning experiments between Kalaallisut and English. Currently looking to improve the quality of pseudoparallel data. Final project for LING28/Computational Linguistics, Dartmouth College, Winter 2022.
AlexJonesNLP/XLAnalysis5K
Code and data for EMNLP 2021 paper "A Massively Multilingual Analysis of Cross-linguality in Shared Embedding Space."
AlexJonesNLP/DS-ML-Python-cheat-sheets
Interactive notebooks that walk through the basics of the most widely used Python data science and ML libraries.
AlexJonesNLP/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
AlexJonesNLP/acl-anthology
Data and software for building the ACL Anthology.
AlexJonesNLP/NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
AlexJonesNLP/text-autoaugment
Code for EMNLP 2021 main conference paper "Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification"
AlexJonesNLP/HyperbolicProcrustesAnalysis
AlexJonesNLP/SentimentMT
Repo associated with "Sentiment-based Candidate Selection for NMT." || Decoder-side sentiment-based translation selection.
AlexJonesNLP/sk-dist
Distributed scikit-learn meta-estimators in PySpark
AlexJonesNLP/Amzn-Product-Quality-Prediction
AlexJonesNLP/HMDM
Hierarchical & multilingual document mining via localized Poincaré projections
AlexJonesNLP/JoSH
[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
AlexJonesNLP/poincare-embeddings
PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"