Yulv-git's Stars
deepseek-ai/DeepSeek-V3
meta-llama/llama3
The official Meta Llama 3 GitHub site
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
KwaiVGI/LivePortrait
Bring portraits to life!
state-spaces/mamba
Mamba SSM architecture
microsoft/BitNet
Official inference framework for 1-bit LLMs
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
QwenLM/Qwen2.5-VL
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
xialeiliu/Awesome-Incremental-Learning
Awesome Incremental Learning
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Tencent/Tencent-Hunyuan-Large
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
ermongroup/SDEdit
PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations
SysCV/sam-pt
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
sail-sg/MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
TiankaiHang/Min-SNR-Diffusion-Training
[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy
iflytek/VLE
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)