Pinned Repositories
DiGIT
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
VAR
[NeurIPS 2024 Oral][GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
torchtune
PyTorch native finetuning library
DiGIT
GPST
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
Open-VideoPoet
SimVQ
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
youngsheen.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
youngsheen's Repositories
youngsheen/SimVQ
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
youngsheen/GPST
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
youngsheen/DiGIT
youngsheen/Open-VideoPoet
youngsheen/youngsheen.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage