codylcs's Stars
lansinuote/Diffusion_Training_Examples
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
GuijiAI/duix.ai
firework8/Awesome-Skeleton-based-Action-Recognition
A curated paper list of awesome skeleton-based action recognition.
OpenGVLab/PIIP
NeurIPS 2024 Spotlight ⭐️ Parameter-Inverted Image Pyramid Networks (PIIP)
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
apple/ml-4m
4M: Massively Multimodal Masked Modeling
mbzuai-oryx/VideoGPT-plus
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
dusty-nv/clip_trt
CLIP and SigLIP models optimized with TensorRT with a Transformers-like API
qinzheng2000/GeneralTrack
HZAI-ZJNU/Mamba-YOLO
the official pytorch implementation of “Mamba-YOLO:SSMs-based for Object Detection”
yunlongdong/Awesome-Embodied-AI
haoranD/Awesome-Embodied-AI
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
huangzongmou/yolov8_Distillation
Kudos12th/kudos_yolov5_knowledge_distillation
Implementation of Distilling Object Detectors with Fine-grained Feature Imitation on yolov5
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
winycg/MCL
[AAAI-2022 Oral] Official implementations of MCL: Mutual Contrastive Learning for Visual Representation Learning
lllyasviel/Fooocus
Focus on prompting and generating
dingmyu/davit
[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"
microsoft/FocalNet
[NeurIPS 2022] Official code for "Focal Modulation Networks"
CAMMA-public/SelfPose3d
Official code for "SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation"
HKUST-LongGroup/Awesome-Open-Vocabulary-Detection-and-Segmentation
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
chongzhou96/EdgeSAM
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
THU-MIG/torch-model-compression
针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库
SoraWebui/SoraWebui
SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model.
microsoft/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
paTRICK-swk/P-STMO
[ECCV2022] The PyTorch implementation for "P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation"
Arthur151/ROMP
Monocular, One-stage, Regression of Multiple 3D People and their 3D positions & trajectories in camera & global coordinates. ROMP[ICCV21], BEV[CVPR22], TRACE[CVPR2023]