whlzy

I like training transformers.

Shanghai Jiao Tong University, Shanghai AI LaboratoryShanghai

whlzy's Stars

karpathy/LLM101n
LLM101n: Let's build a Storyteller
9k353
LTH14/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
Language:Python76733
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python82129
FoundationVision/OmniTokenizer
OmniTokenizer: one model and one weight for image-video joint tokenization.
Language:Python1563
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
1k51
feifeibear/long-context-attention
Sequence Parallel Attention for Long Context LLM Model Training and Inference
Language:Python1887
zhaoyue-zephyrus/bsq-vit
[BSQ-ViT] Image and Video Tokenization with Binary Spherical Quantization
Language:Python46
lucidrains/titok-pytorch
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
Language:Python1312
Q-Future/CMC-Bench
[LMM + codec] A new paradigm of visual signal compression!
Language:Python16
hsiehjackson/RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Language:Python30216
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Language:Python47612
OpenGVLab/InternVideo2
1741
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
Language:TypeScript7.3k934
idootop/mi-gpt
🏠 将小爱音箱接入 ChatGPT 和豆包，改造成你的专属语音助手。
Language:TypeScript5.5k464
HaoyiZhu/PointCloudMatters
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
Language:Python14
hjq133/piccolo-embedding
code for piccolo embedding model from SenseTime
Language:Python401
gojasper/flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Language:Python26514
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell5.5k301
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.4k146
Doraemonzzz/vector-quantize
Language:Python4
tianweiy/DMD2
Language:Python27918
lllyasviel/Omost
Your image is almost there!
Language:Python6.6k394
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python51632
discus0434/aesthetic-predictor-v2-5
SigLIP-based Aesthetic Score Predictor
Language:Python76
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
Language:Python76786
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python3.7k282
sony/ctm
Language:Python1939
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
Language:Python14.7k1.2k
HigherOrderCO/Bend
A massively parallel, high-level programming language
Language:Rust16.5k409
allenai/unified-io-2
Language:Python52825

whlzy

whlzy's Stars

karpathy/LLM101n

LTH14/rcg

FoundationVision/LlamaGen

FoundationVision/OmniTokenizer

deepseek-ai/DeepSeek-Coder-V2

feifeibear/long-context-attention

zhaoyue-zephyrus/bsq-vit

lucidrains/titok-pytorch

Q-Future/CMC-Bench

hsiehjackson/RULER

mit-han-lab/distrifuser

OpenGVLab/InternVideo2

leptonai/search_with_lepton

idootop/mi-gpt

HaoyiZhu/PointCloudMatters

hjq133/piccolo-embedding

gojasper/flash-diffusion

QwenLM/Qwen2

Vchitect/Latte

Doraemonzzz/vector-quantize

tianweiy/DMD2

lllyasviel/Omost

jzhang38/EasyContext

discus0434/aesthetic-predictor-v2-5

christophschuhmann/improved-aesthetic-predictor

FoundationVision/VAR

sony/ctm

borisdayma/dalle-mini

HigherOrderCO/Bend

allenai/unified-io-2