Pinned Repositories
Awesome-Depth-Estimation
Awesome List for Depth Estimation
Awesome-Fine-grained-Visual-Classification
Awesome Fine-grained Visual Classification
awesome-yolo-object-detection
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.
CAL
[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification
CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
coco-caption
A python3 version of coco-caption with spice.
DiffSketcher
[NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxiv.org/abs/2306.14685
finetune-anything
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
GeoWizard
[arXiv'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
HMN
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
Yhc-777's Repositories
Yhc-777/Awesome-Depth-Estimation
Awesome List for Depth Estimation
Yhc-777/Awesome-Fine-grained-Visual-Classification
Awesome Fine-grained Visual Classification
Yhc-777/awesome-yolo-object-detection
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.
Yhc-777/CAL
[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification
Yhc-777/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Yhc-777/coco-caption
A python3 version of coco-caption with spice.
Yhc-777/DiffSketcher
[NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxiv.org/abs/2306.14685
Yhc-777/finetune-anything
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
Yhc-777/GeoWizard
[arXiv'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Yhc-777/HMN
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
Yhc-777/LeYOLO
Yhc-777/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Yhc-777/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
Yhc-777/object_feature_extraction
Using yolov516.pth and fasterrcnn_resnet50_fpn_coco-258fb6c6.pth to extract object features.
Yhc-777/PromptIR
PromptIR: Prompting for All-in-One Blind Image Restoration [NeurIPS 2023]
Yhc-777/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Yhc-777/V3D
V3D: Video Diffusion Models are Effective 3D Generators
Yhc-777/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as I3D, R(2+1)D, VGGish, ResNet, CLIP features.
Yhc-777/Wonder3D
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
Yhc-777/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
Yhc-777/yolov10
YOLOv10: Real-Time End-to-End Object Detection
Yhc-777/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Yhc-777/yolov9-onnx
Python Implementation for Performing Object Detection Using YOLOv9 with ONNX & ONNXRuntime