Pinned Repositories
100-nlp-papers
100 Must-Read NLP Papers
africanlp-public-datasets
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
afromt
Code for the EMNLP 2021 Paper "AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages" by Machel Reid, Junjie Hu, Graham Neubig, Yutaka Matsuo
BangBei-APP
BangBei is an android app which was designed to be used inside the campus of UESTC to let students help each other and make money at the same time. It has won 2017 UESTC programing competition.
deepframeworks
Evaluation of Deep Learning Frameworks
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
KINNEWS-and-KIRNEWS-Corpus
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.
kkltk
The Kinyarwanda and Kirundi Languages Toolkit (KKLTK) is a Python package for Kinyarwanda and Kirundi languages processing. KKLTK currently provides the sets of stopwords for both languages and other preprocessing tools such as Kinyarwanda and Kirundi tokenizers will be added soon. KKLTK requires Python 3.0, 3.5, 3.6, 3.7, or 3.8.
nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
UESTC_2016_Freshman_web
This is a web developed in UESTC-IUSTU workshop which was designed for new members to learn about web development, mobile app development (Android&ios), etc.
Andrews2017's Repositories
Andrews2017/africanlp-public-datasets
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
Andrews2017/KINNEWS-and-KIRNEWS-Corpus
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.
Andrews2017/nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
Andrews2017/BangBei-APP
BangBei is an android app which was designed to be used inside the campus of UESTC to let students help each other and make money at the same time. It has won 2017 UESTC programing competition.
Andrews2017/UESTC_2016_Freshman_web
This is a web developed in UESTC-IUSTU workshop which was designed for new members to learn about web development, mobile app development (Android&ios), etc.
Andrews2017/afromt
Code for the EMNLP 2021 Paper "AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages" by Machel Reid, Junjie Hu, Graham Neubig, Yutaka Matsuo
Andrews2017/Andrews2017.github.io
Andrews2017/annotated_latex_equations
Examples of how to create colorful, annotated equations in Latex using Tikz.
Andrews2017/bert
TensorFlow code and pre-trained models for BERT
Andrews2017/bitextor
Bitextor generates translation memories from multilingual websites
Andrews2017/cgcnn
Crystal graph convolutional neural networks for predicting material properties.
Andrews2017/Data-Science-Articles
A collection of my data science articles published in Towards Data Science and Towards AI.
Andrews2017/GEM-benchmark.github.io
Andrews2017/lafand-mt
LAFAND-MT: Lacuna Anglo & Franco Africa News Dataset for low-resourced MT
Andrews2017/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Andrews2017/Llama-2-notebooks
All the projects related to Llama
Andrews2017/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Andrews2017/masakhane-community
All our community docs! Start here! Lets put Africa on the NLP Map
Andrews2017/masakhane-preprocessing
Building an effective preprocessing tool for African languages
Andrews2017/ML-Papers-Explained
Explanation to key concepts in ML
Andrews2017/multi_gpu_training
Andrews2017/Neo4j-ParticleFiltering
A user-defined procedure based on Markov-chains to approximate the Personalized PageRank algorithm in Neo4j
Andrews2017/PLMpapers
Must-read Papers on pre-trained language models.
Andrews2017/pytorch-sentiment-analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
Andrews2017/pytorch-seq2seq
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
Andrews2017/speechbrain
A PyTorch-based Speech Toolkit
Andrews2017/synthesis
Data synthesis by contextualizing glossary translations
Andrews2017/TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
Andrews2017/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Andrews2017/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.