codylcs

codylcs's Stars

lansinuote/Diffusion_Training_Examples
Language:Jupyter Notebook7016
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
Language:Python3.1k240
GuijiAI/duix.ai
Language:C++4.5k647
firework8/Awesome-Skeleton-based-Action-Recognition
A curated paper list of awesome skeleton-based action recognition.
39656
OpenGVLab/PIIP
NeurIPS 2024 Spotlight ⭐️ Parameter-Inverted Image Pyramid Networks (PIIP)
Language:Python462
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
Language:Python6.4k574
apple/ml-4m
4M: Massively Multimodal Masked Modeling
Language:Python1.6k90
mbzuai-oryx/VideoGPT-plus
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Language:Python19315
dusty-nv/clip_trt
CLIP and SigLIP models optimized with TensorRT with a Transformers-like API
Language:Python131
qinzheng2000/GeneralTrack
Language:Python183
HZAI-ZJNU/Mamba-YOLO
the official pytorch implementation of “Mamba-YOLO：SSMs-based for Object Detection”
Language:Python22426
yunlongdong/Awesome-Embodied-AI
24919
haoranD/Awesome-Embodied-AI
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
2535
huangzongmou/yolov8_Distillation
Language:Python7913
Kudos12th/kudos_yolov5_knowledge_distillation
Implementation of Distilling Object Detectors with Fine-grained Feature Imitation on yolov5
Language:Python72
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python4.4k427
winycg/MCL
[AAAI-2022 Oral] Official implementations of MCL: Mutual Contrastive Learning for Visual Representation Learning
Language:Python694
lllyasviel/Fooocus
Focus on prompting and generating
Language:Python40.4k5.6k
dingmyu/davit
[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"
Language:Python32233
microsoft/FocalNet
[NeurIPS 2022] Official code for "Focal Modulation Networks"
Language:Python68462
CAMMA-public/SelfPose3d
Official code for "SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation"
Language:Python252
HKUST-LongGroup/Awesome-Open-Vocabulary-Detection-and-Segmentation
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
946
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Language:Python1.2k47
chongzhou96/EdgeSAM
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Language:Jupyter Notebook90741
THU-MIG/torch-model-compression
针对pytorch模型的自动化模型结构分析和修改工具集，包含自动分析模型结构的模型压缩算法库
Language:Python23640
SoraWebui/SoraWebui
SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model.
Language:TypeScript2.3k509
microsoft/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Language:Python14k1.8k
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python6.8k522
paTRICK-swk/P-STMO
[ECCV2022] The PyTorch implementation for "P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation"
Language:Python14710
Arthur151/ROMP
Monocular, One-stage, Regression of Multiple 3D People and their 3D positions & trajectories in camera & global coordinates. ROMP[ICCV21], BEV[CVPR22], TRACE[CVPR2023]
Language:Python1.3k229