forhaoliu

San Francisco Bay Area

Pinned Repositories

chain-of-hindsight
Chain-of-Hindsight, A Scalable RLHF Method
Language:Python215 4 1717
hybrid-discriminative-generative
Hybrid Discriminative-Generative Training via Contrastive Learning
Language:Python75 5 47
instructrl
Instruction Following Agents with Multimodal Transforemrs
Language:Python50 1 45
language-quantized-autoencoders
Language Quantized AutoEncoders
Language:Python93 1 35
mini_apt
Language:Python7 1 02
ringattention
Transformers with Arbitrarily Large Context
Language:Python625 6 1648
taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
Language:Python29 1 23
tux
Tools and Utils for Experiments (TUX)
Language:Python13 2 14
LWM
Large World Model With 1M Context
Language:Python7.1k 66 71551
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.4k 121 91378

forhaoliu's Repositories

forhaoliu/ringattention
Transformers with Arbitrarily Large Context
Language:Python625 6 1648
forhaoliu/chain-of-hindsight
Chain-of-Hindsight, A Scalable RLHF Method
Language:Python215 4 1717
forhaoliu/language-quantized-autoencoders
Language Quantized AutoEncoders
Language:Python93 1 35
forhaoliu/hybrid-discriminative-generative
Hybrid Discriminative-Generative Training via Contrastive Learning
Language:Python75 5 47
forhaoliu/instructrl
Instruction Following Agents with Multimodal Transforemrs
Language:Python50 1 45
forhaoliu/taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
Language:Python29 1 23
forhaoliu/tux
Tools and Utils for Experiments (TUX)
Language:Python13 2 14
forhaoliu/mini_apt
Language:Python7 1 02
forhaoliu/jax_sac
Language:Python4 3 0