Pinned Repositories
LWM
Large World Model -- Modeling Text and Video with Millions Context
chain-of-hindsight
Chain-of-Hindsight, A Scalable RLHF Method
hybrid-discriminative-generative
Hybrid Discriminative-Generative Training via Contrastive Learning
instructrl
Instruction Following Agents with Multimodal Transforemrs
language-quantized-autoencoders
Language Quantized AutoEncoders
mini_apt
ringattention
Transformers with Arbitrarily Large Context
taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
tux
Tools and Utils for Experiments (TUX)
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
lhao499's Repositories
lhao499/ringattention
Transformers with Arbitrarily Large Context
lhao499/chain-of-hindsight
Chain-of-Hindsight, A Scalable RLHF Method
lhao499/language-quantized-autoencoders
Language Quantized AutoEncoders
lhao499/hybrid-discriminative-generative
Hybrid Discriminative-Generative Training via Contrastive Learning
lhao499/instructrl
Instruction Following Agents with Multimodal Transforemrs
lhao499/taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
lhao499/tux
Tools and Utils for Experiments (TUX)
lhao499/mini_apt
lhao499/jax_sac
lhao499/jax_apt