cheng221

Shanghai

cheng221's Stars

Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Language:Python2.1k181
deepseek-ai/DeepSeek-R1
29.9k2.5k
km1994/LLMs_interview_notes
该仓库主要记录大模型（LLMs）算法工程师相关的面试题
1.6k116
deepseek-ai/DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Language:Python1.2k274
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Language:Python1.3k68
ikatyang/emoji-cheat-sheet
A markdown version emoji cheat sheet
Language:TypeScript12.7k4.5k
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Language:Python3.2k188
showlab/Show-o
[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python1.1k49
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python2k79
instantX-research/Regional-Prompting-FLUX
Training-free Regional Prompting for Diffusion Transformers 🔥
Language:Python53120
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python35.5k2.7k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.2k2.3k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.7k876
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
Language:Python46943
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Language:Python12.3k3k
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
Language:Python3.5k319
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)
Language:Python81340
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）
Language:C++22.4k5.7k
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
1.2k43
shallowdream204/DreamClear
[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Language:Python88247
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python1k69
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.9k264
PaddlePaddle/PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Language:Python475174
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.3k406
QwenLM/Qwen2.5-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Jupyter Notebook4.4k270
gerdm/prml
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
Language:Jupyter Notebook2.2k503
cure-lab/PnPInversion
[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
Language:Jupyter Notebook28113
ohayonguy/PMRF
Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
Language:Python58933
EternalEvan/FlowIE
This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"
Language:Python942
IDKiro/sdxs
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
Language:Python59523

cheng221

cheng221's Stars

Tencent/MimicMotion

deepseek-ai/DeepSeek-R1

km1994/LLMs_interview_notes

deepseek-ai/DeepSeek-VL2

LTH14/mar

ikatyang/emoji-cheat-sheet

NVlabs/Sana

showlab/Show-o

baaivision/Emu3

instantX-research/Regional-Prompting-FLUX

gradio-app/gradio

haotian-liu/LLaVA

BradyFU/Awesome-Multimodal-Large-Language-Models

kvablack/ddpo-pytorch

PaddlePaddle/PaddleNLP

deepseek-ai/Janus

sihyun-yu/REPA

PaddlePaddle/Paddle

Xnhyacinth/Awesome-LLM-Long-Context-Modeling

shallowdream204/DreamClear

DAMO-NLP-SG/VideoLLaMA2

DAMO-NLP-SG/Video-LLaMA

PaddlePaddle/PaddleMIX

QwenLM/Qwen-VL

QwenLM/Qwen2.5-VL

gerdm/prml

cure-lab/PnPInversion

ohayonguy/PMRF

EternalEvan/FlowIE

IDKiro/sdxs