Pinned Repositories
3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
AdaFace
APTM
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
ARKitTrack
PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. Code will be released here.
AutoPrompt
A framework for prompt tuning using Intent-based Prompt Calibration
awesome-virtual-try-on
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
FacePose_pytorch
🔥🔥The pytorch implement of the head pose estimation(yaw,roll,pitch) and emotion detection with SOTA performance in real time.Easy to deploy, easy to use, and high accuracy.Solve all problems of face detection at one time.(极简,极快,高效是我们的宗旨)
HairMapper
HairMapper: Removing Hair from Portraits Using GANs
TextLogoLayout
[CVPR 2022] Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
sssssshf's Repositories
sssssshf/3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
sssssshf/AdaFace
sssssshf/APTM
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
sssssshf/ARKitTrack
PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. Code will be released here.
sssssshf/AutoPrompt
A framework for prompt tuning using Intent-based Prompt Calibration
sssssshf/awesome-virtual-try-on
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
sssssshf/chat2KnowL
知识文档问答,用大模型与文档对话,提供Al分析、阅读、问答工具,助你快速了解文档内容。
sssssshf/chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
sssssshf/CoDet
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
sssssshf/DINOv
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
sssssshf/fc-clip
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
sssssshf/Fracture_Detection_Improved_YOLOv8
ICONIP 2024
sssssshf/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
sssssshf/InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
sssssshf/KuiperInfer
带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
sssssshf/MaskFaceTool
This project aims to add masks to the facial dataset, which is based on FMA-3D and constructs a effective, easy to operate, and efficient pipeline for facial detection, alignment, and mask wearing.
sssssshf/MMDU
Official repository of MMDU dataset
sssssshf/mPLUG-Owl
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
sssssshf/MR-GDINO
sssssshf/onnx2tflite
Tool for onnx->keras or onnx->tflite. If tool is useful for you, please star it.
sssssshf/OPERA
[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
sssssshf/PromptAgent
This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that autonomously crafts prompts equivalent in quality to those handcrafted by experts, i.e., expert-level prompts.
sssssshf/revTongYi
阿里云 通义千问、通义万相 逆向工程 Python API
sssssshf/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
sssssshf/SHIKE
Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation (CVPR 2023)
sssssshf/Simple-TensorRT
Secondary encapsulation of NVIDIA TensorRT interface to simplify the calling process
sssssshf/tensorRT_Pro
C++ library based on tensorrt integration
sssssshf/ViP-LLaVA
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
sssssshf/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
sssssshf/YoloV7-ncnn-Raspberry-Pi-4
YoloV7 for a bare Raspberry Pi using ncnn.