lhao499

UC BerkeleySan Francisco Bay Area

Pinned Repositories

LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71554
chain-of-hindsight
Chain-of-Hindsight, A Scalable RLHF Method
Language:Python210 4 1717
hybrid-discriminative-generative
Hybrid Discriminative-Generative Training via Contrastive Learning
Language:Python75 5 47
instructrl
Instruction Following Agents with Multimodal Transforemrs
Language:Python49 1 34
language-quantized-autoencoders
Language Quantized AutoEncoders
Language:Python92 1 35
mini_apt
Language:Python7 1 02
ringattention
Transformers with Arbitrarily Large Context
Language:Python571 5 1543
taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
Language:Python28 1 23
tux
Tools and Utils for Experiments (TUX)
Language:Python13 2 14
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.4k 122 91382

lhao499's Repositories

lhao499/ringattention
Transformers with Arbitrarily Large Context
Language:Python571 5 1543
lhao499/chain-of-hindsight
Chain-of-Hindsight, A Scalable RLHF Method
Language:Python210 4 1717
lhao499/language-quantized-autoencoders
Language Quantized AutoEncoders
Language:Python92 1 35
lhao499/hybrid-discriminative-generative
Hybrid Discriminative-Generative Training via Contrastive Learning
Language:Python75 5 47
lhao499/instructrl
Instruction Following Agents with Multimodal Transforemrs
Language:Python49 1 34
lhao499/taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
Language:Python28 1 23
lhao499/tux
Tools and Utils for Experiments (TUX)
Language:Python13 2 14
lhao499/mini_apt
Language:Python7 1 02
lhao499/jax_sac
Language:Python4 3 00
lhao499/jax_apt
Language:Python0 2 00