leuchine

Machine Learning and Natural Language Processing Researcher

University of Hong Kong, Reka AIHong Kong

leuchine's Stars

openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook23.2k 317 3853.1k
openai/openai-python
The official Python library for the OpenAI API
Language:Python20.8k 284 6872.8k
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
12.6k 113 461.3k
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Language:Python9.8k 226 2772.2k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python9.1k 157 5602.1k
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Language:Python8.9k 85 348688
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.1k 129 1k1.3k
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Language:Python7.7k 302 2591.4k
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.7k 120 427968
snorkel-team/snorkel
A system for quickly generating training data with weak supervision
Language:Python5.7k 168 980860
yandex/YaLM-100B
Pretrained language model with 100B parameters
Language:Python3.7k 48 28296
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
Language:Python3k 44 357264
huggingface/huggingface_hub
The official Python client for the Huggingface Hub.
Language:Python1.8k 59 825463
EleutherAI/the-pile
Language:Python1.4k 30 100120
facebookresearch/diplomacy_cicero
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
Language:Python1.3k 24 20156
microsoft/mup
maximal update parametrization (µP)
Language:Jupyter Notebook1.2k 29 5887
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.2k 17 82106
lucidrains/perceiver-pytorch
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
Language:Python1.1k 31 59133
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell958 37 19100
GEM-benchmark/NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
Language:Python763 23 52195
LeoGrin/tabular-benchmark
Language:Python430 7 2059
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Language:Jupyter Notebook288 24 1241
huggingface/olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
Language:Python169 12 521
lxuechen/private-transformers
A codebase that makes differentially private training of transformers easy.
Language:Python143 5 3020
reka-ai/reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
Language:Python1376
sebastianGehrmann/CausalMediationAnalysis
Code for the paper "Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias"
Language:Python68 8 417
yzpang/gold-off-policy-text-gen-iclr21
Language:Python49 3 06
UKPLab/nessie
Automatically detect errors in annotated corpora.
Language:Python45 7 05
EleutherAI/pile-cc
Language:Python15 2 01
mlcommons/dataperf
Data Benchmarking
13 5 75

leuchine

leuchine's Stars

openai/CLIP

openai/openai-python

dair-ai/ml-visuals

openai/spinningup

NVIDIA/Megatron-LM

cleanlab/cleanlab

speechbrain/speechbrain

openai/jukebox

EleutherAI/gpt-neox

snorkel-team/snorkel

yandex/YaLM-100B

facebookresearch/fairscale

huggingface/huggingface_hub

EleutherAI/the-pile

facebookresearch/diplomacy_cicero

microsoft/mup

PKU-Alignment/safe-rlhf

lucidrains/perceiver-pytorch

bigscience-workshop/bigscience

GEM-benchmark/NL-Augmenter

LeoGrin/tabular-benchmark

bigscience-workshop/data-preparation

huggingface/olm-datasets

lxuechen/private-transformers

reka-ai/reka-vibe-eval

sebastianGehrmann/CausalMediationAnalysis

yzpang/gold-off-policy-text-gen-iclr21

UKPLab/nessie

EleutherAI/pile-cc

mlcommons/dataperf