Yulv-git

Computer Vision, Medical Image Computing, LLM

PFCC, iFLYTEKGuangzhou, China

Yulv-git's Stars

deepseek-ai/DeepSeek-V3
Language:Python94.1k 734 48415.2k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python28.6k 243 2873.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python25.8k 203 5732.5k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python21k 304 1.4k2.6k
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python14.5k 126 4181.6k
state-spaces/mamba
Mamba SSM architecture
Language:Python14.4k 105 6281.3k
microsoft/BitNet
Official inference framework for 1-bit LLMs
Language:C++12.8k 134 107909
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.9k 155 3731.1k
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook11.4k 146 3811.1k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python11k 135 5821.1k
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
Language:Python10.8k 228 901.9k
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Language:Python9.4k 111 218791
QwenLM/Qwen2.5-VL
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Jupyter Notebook9.2k 56 781642
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python7.7k 46 330778
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.3k 66 73557
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python7k 46 88634
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Language:Jupyter Notebook5.1k 44 131524
xialeiliu/Awesome-Incremental-Learning
Awesome Incremental Learning
4k 134 49590
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
Language:Jupyter Notebook3.8k 80 145231
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python3.2k 78 127278
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
Language:C++2.6k 66 981495
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.7k 22 8986
Tencent/Tencent-Hunyuan-Large
Language:Python1.5k 26 1795
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
1.1k 25 278
ermongroup/SDEdit
PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations
Language:Python1.1k 22 3089
SysCV/sam-pt
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
Language:Python988 41 3463
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
Language:Python793 11 3349
sail-sg/MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
Language:Python553 16 5240
TiankaiHang/Min-SNR-Diffusion-Training
[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy
Language:Python243 2 98
iflytek/VLE
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)
Language:Python188 6 914

Yulv-git

Yulv-git's Stars

deepseek-ai/DeepSeek-V3

meta-llama/llama3

hpcaitech/Open-Sora

microsoft/unilm

KwaiVGI/LivePortrait

state-spaces/mamba

microsoft/BitNet

PKU-YuanGroup/Open-Sora-Plan

facebookresearch/seamless_communication

THUDM/CogVideo

openai/DALL-E

Tencent/HunyuanVideo

QwenLM/Qwen2.5-VL

IDEA-Research/GroundingDINO

LargeWorldModel/LWM

facebookresearch/DiT

ChaoningZhang/MobileSAM

xialeiliu/Awesome-Incremental-Learning

SysCV/sam-hq

gpt-omni/mini-omni

pytorch/executorch

baaivision/Emu

Tencent/Tencent-Hunyuan-Large

XueFuzhao/awesome-mixture-of-experts

ermongroup/SDEdit

SysCV/sam-pt

willisma/SiT

sail-sg/MDT

TiankaiHang/Min-SNR-Diffusion-Training

iflytek/VLE