GSK666's Stars
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
google/automl
Google Brain AutoML
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
google-research/multinerf
A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
lucidrains/lion-pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Sense-X/Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
omerbt/MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
G-U-N/AnimateLCM
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
baaivision/tokenize-anything
[ECCV 2024] Tokenize Anything via Prompting
dome272/VQGAN-pytorch
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
AnythingInAnyScene/anything_in_anyscene
ViTAE-Transformer/QFormer
The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"
HeliosZhao/Animate124
Animate124: Animating One Image to 4D Dynamic Scene
xdxie/WordArt
The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.
JCZ404/Semi-DETR
[CVPR 2023] Official implementation of the paper "Semi-DETR: Semi-Supervised Object Detection with Detection Transformers"
Czm369/MixPL
Mixed Pseudo Labels for Semi-Supervised Object Detection
alanzty/KCR-Official
This is official implementation of KCR.
ali-vilab/i2vgen-xl
replaceanything3d/replaceanything3d.github.io