cameron-chen

Pinned Repositories

alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook00
cameron-chen.github.io
Language:HTML0 1 00
flow-iar
A PyTorch implementation of the flow policy with invalid action rejection for large discrete (categorical) action space with constraints.
Language:Python2 1 01
imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python0 0 00
IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
Language:Python1 0 00
mgm
A PyTorch implementation of Multiscale Generative Models.
Language:Python5 3 00
SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python0 0 00
dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
Language:Python36 3 01

cameron-chen's Repositories

cameron-chen/mgm
A PyTorch implementation of Multiscale Generative Models.
Language:Python5 3 00
cameron-chen/flow-iar
A PyTorch implementation of the flow policy with invalid action rejection for large discrete (categorical) action space with constraints.
Language:Python2 1 01
cameron-chen/IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
Language:Python1 0 00
cameron-chen/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook00
cameron-chen/cameron-chen.github.io
Language:HTML0 1 00
cameron-chen/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python0 0 00
cameron-chen/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python0 0 00