Czi24

Czi24's Stars

PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
Language:Python2.8k368
Deep-Agent/R1-V
Witness the aha moment of VLM with less than $3.
Language:Python3.3k255
EvolvingLMMs-Lab/open-r1-multimodal
A fork to add multimodal model training to open-r1
Language:Python1.1k54
om-ai-lab/VLM-R1
Solve Visual Understanding with Reinforced VLMs
Language:Python4.1k255
getAsterisk/deepclaude
A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.
Language:Rust4.8k380
Jiayi-Pan/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Language:Python11.2k1.4k
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Language:Python22.8k2.1k
modelscope/awesome-deep-reasoning
Collect every awesome work about r1!
Language:Python2808
agentica-project/deepscaler
Democratizing Reinforcement Learning for LLMs
Language:Python2k176
schuy1er/EWF_official
An official code for "Endpoints Weight Fusion for Class Incremental Semantic Segmentation"
Language:Python325
MrGiovanni/ContinualLearning
[MICCAI 2023] Continual Learning for Abdominal Multi-Organ and Tumor Segmentation
Language:Python689
arthurdouillard/CVPR2021_PLOP
Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation
Language:Python15122
simplescaling/s1
s1: Simple test-time scaling
Language:Python6k690
shawnricecake/Heima
Code for Heima
Language:Python343
DAMO-NLP-SG/DiGIT
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Language:Python642
The-AI-Alliance/GEO-Bench-VLM
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks
331
SegmentationBLWX/cssegmentation
CSSegmentation: An Open Source Continual Semantic Segmentation Toolbox Based on PyTorch.
Language:Python334
LMM101/Awesome-Multimodal-Next-Token-Prediction
[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
3869
meta-llama/llama
Inference code for Llama models
Language:Python57.9k9.7k
mbzuai-oryx/LlamaV-o1
Rethinking Step-by-step Visual Reasoning in LLMs
Language:Python27517
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python8.3k514
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python96942
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.6k72
FoundationVision/Infinity
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Language:Python1k42
PKU-YuanGroup/Next-Patch-Prediction
Language:Python1333
AILab-CVC/SEED-X
Multimodal Models in Real World
Language:Jupyter Notebook44420
mit-han-lab/vila-u
[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Language:Python2467
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.9k445
ByteFlow-AI/TokenFlow
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
Language:Python2871
deepcs233/Visual-CoT
[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
Language:Python26912