bcui19

Pinned Repositories

arena-hard
Arena-Hard benchmark
Language:Jupyter Notebook00
CompilerGym
A reinforcement learning toolkit for compiler optimizations
Language:Python00
composer
Train neural networks up to 7x faster
Language:Python00
CS-148
00
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python00
Glooko
00
megablocks
Language:Python00
mosaic_examples
Fast and flexible reference benchmarks
Language:Python00
NeMo
NeMo: a toolkit for conversational AI
Language:Python00
off-belief-learning
Implementation of the Off Belief Learning algorithm.
Language:Python0 0 00

bcui19's Repositories

bcui19/arena-hard
Arena-Hard benchmark
Language:Jupyter Notebook00
bcui19/CompilerGym
A reinforcement learning toolkit for compiler optimizations
Language:Python00
bcui19/composer
Train neural networks up to 7x faster
Language:Python00
bcui19/CS-148
00
bcui19/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python00
bcui19/Glooko
00
bcui19/megablocks
Language:Python00
bcui19/mosaic_examples
Fast and flexible reference benchmarks
Language:Python00
bcui19/NeMo
NeMo: a toolkit for conversational AI
Language:Python00
bcui19/off-belief-learning
Implementation of the Off Belief Learning algorithm.
Language:Python0 0 00
bcui19/PCC-pytorch
A pytorch implementation of the paper "Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control"
Language:Python0 1 00
bcui19/probability
Probabilistic reasoning and statistical analysis in TensorFlow
bcui19/reward-bench
RewardBench: the first evaluation tool for reward models.
bcui19/RL4LMs
A modular RL library to fine-tune language models to human preferences
bcui19/rlmeta
RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.
bcui19/scratch
scratch work
Language:Python1 01
bcui19/streaming
A Data Streaming Library for Efficient Neural Network Training
bcui19/tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++
bcui19/toolbox
Essential guides and programming tools in my toolbox (with focus on ML Training)