Pinned Repositories
arena-hard
Arena-Hard benchmark
CompilerGym
A reinforcement learning toolkit for compiler optimizations
composer
Train neural networks up to 7x faster
CS-148
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Glooko
megablocks
mosaic_examples
Fast and flexible reference benchmarks
NeMo
NeMo: a toolkit for conversational AI
off-belief-learning
Implementation of the Off Belief Learning algorithm.
bcui19's Repositories
bcui19/arena-hard
Arena-Hard benchmark
bcui19/CompilerGym
A reinforcement learning toolkit for compiler optimizations
bcui19/composer
Train neural networks up to 7x faster
bcui19/CS-148
bcui19/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
bcui19/Glooko
bcui19/megablocks
bcui19/mosaic_examples
Fast and flexible reference benchmarks
bcui19/NeMo
NeMo: a toolkit for conversational AI
bcui19/off-belief-learning
Implementation of the Off Belief Learning algorithm.
bcui19/PCC-pytorch
A pytorch implementation of the paper "Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control"
bcui19/probability
Probabilistic reasoning and statistical analysis in TensorFlow
bcui19/reward-bench
RewardBench: the first evaluation tool for reward models.
bcui19/RL4LMs
A modular RL library to fine-tune language models to human preferences
bcui19/rlmeta
RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.
bcui19/scratch
scratch work
bcui19/streaming
A Data Streaming Library for Efficient Neural Network Training
bcui19/tensorflow
Computation using data flow graphs for scalable machine learning
bcui19/toolbox
Essential guides and programming tools in my toolbox (with focus on ML Training)