Pinned Repositories
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
cameron-chen.github.io
flow-iar
A PyTorch implementation of the flow policy with invalid action rejection for large discrete (categorical) action space with constraints.
imitation
Clean PyTorch implementations of imitation and reward learning algorithms
IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
mgm
A PyTorch implementation of Multiscale Generative Models.
SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
cameron-chen's Repositories
cameron-chen/mgm
A PyTorch implementation of Multiscale Generative Models.
cameron-chen/flow-iar
A PyTorch implementation of the flow policy with invalid action rejection for large discrete (categorical) action space with constraints.
cameron-chen/IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
cameron-chen/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
cameron-chen/cameron-chen.github.io
cameron-chen/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
cameron-chen/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward