aulaywang's Stars
michuanhaohao/reid-strong-baseline
Bag of Tricks and A Strong Baseline for Deep Person Re-identification
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Bujiazi/MotionClone
Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
tyfeld/train-StreamingT2V
Train/Finetune StreamingT2V
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
ifzhang/FairMOT
[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
YuzheZhang-1999/DiffTSR
[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
yeungchenwa/Recommendations-Diffusion-Text-Image
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten generation, scene text recognition and scene text detection.
michuanhaohao/AlignedReID
Alignedreid++: Dynamically Matching Local Information for Person Re-Identification.
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
ultralytics/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
XPixelGroup/DiffBIR
Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
THUDM/Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
gitmehrdad/SFSORT
SFSORT: Scene Features-based Simple Online Real-Time Tracker
Sharpiless/Yolov5-Deepsort
最新版本yolov5+deepsort目标检测和追踪,能够显示目标类别,支持5.0版本可训练自己数据集
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
XPixelGroup/HAT
CVPR2023 - Activating More Pixels in Image Super-Resolution Transformer Arxiv - HAT: Hybrid Attention Transformer for Image Restoration
XPixelGroup/BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
guptapraful/niqe
NIQE for IQA in python..
zhengchen1999/RGT
PyTorch code for our ICLR 2024 paper "Recursive Generalization Transformer for Image Super-Resolution"
nwojke/deep_sort
Simple Online Realtime Tracking with a Deep Association Metric
megvii-research/HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
mseitzer/pytorch-fid
Compute FID scores with PyTorch.
openai/guided-diffusion
zoomin-lee/SemCity
[CVPR 2024] The official implementation for "SemCity: Semantic Scene Generation with Triplane Diffusion"