Pinned Repositories
100-Days-Of-ML-Code
100 Days of ML Coding
2020AICITY_Code_From_Top_Teams
The code from the top teams in the 2020 AI City Challenge
2021-CV-Surveys
2021 年,计算机视觉相关综述。包括目标检测、跟踪........
20bn-realtimenet
20bn-realtimenet: Enhance your application with the ability to see and interact with humans using any RGB camera.
3DDFA_V2
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
3DI
EAGRNet
Edge-aware Graph Representation Learning and Reasoning for Face Parsing (ECCV 2020)
Local-Crowd-Counting
Adaptive Mixture Regression Network with Local Counting Map for Crowd Counting
MGCNet
Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency[ECCV 2020]
Optimized_SoundTouch
Optimized version of SoundTouch
mc261670164's Repositories
mc261670164/awesome-conditional-content-generation
Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.
mc261670164/Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
mc261670164/ComfyUI-PhotoMaker
Unofficial implementation of PhotoMaker for ComfyUI
mc261670164/daily-paper-computer-vision
记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文
mc261670164/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
mc261670164/dust3r
mc261670164/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
mc261670164/flowmap
Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann
mc261670164/Fooocus
Focus on prompting and generating
mc261670164/getIntoGameDev
mc261670164/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
mc261670164/HeadphoneSurroundVirtualization
mc261670164/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
mc261670164/litepose
[CVPR'22] Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
mc261670164/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
mc261670164/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
mc261670164/PhotoMaker
PhotoMaker
mc261670164/plasticity
mc261670164/pyskl
A toolbox for skeleton-based action recognition.
mc261670164/PyTorch-Tutorial-2nd
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
mc261670164/roop
one-click deepfake (face swap)
mc261670164/sapiens
High-resolution models for human tasks.
mc261670164/T-Rex
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
mc261670164/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
mc261670164/tinyrenderer
A brief computer graphics / rendering course
mc261670164/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
mc261670164/VidToMe
Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)
mc261670164/VulkanTutorial
Tutorial for the Vulkan graphics and compute API
mc261670164/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
mc261670164/yolov5-flask-new
Official implementation at https://github.com/ultralytics/yolov5/tree/master/utils/flask_rest_api