zhn6818's Stars
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
xingyizhou/CenterNet
Object detection, 3D detection, and pose estimation using center point detection:
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
cvg/Hierarchical-Localization
Visual localization made easy with hloc
jingyaogong/minimind
「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!
vietanhdev/anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
zju3dv/LoFTR
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
facebookresearch/Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
om-ai-lab/OmDet
Real-time and accurate open-vocabulary end-to-end object detection
mihaidusmanu/d2-net
D2-Net: A Trainable CNN for Joint Description and Detection of Local Features
ziqi-jin/finetune-anything
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
erikbern/deep-fonts
Generate fonts using deep learning
datvuthanh/HybridNets
HybridNets: End-to-End Perception Network
tobybreckon/fire-detection-cnn
real-time fire detection in video imagery using a convolutional neural network (deep learning) - from our ICIP 2018 paper (Dunnings / Breckon) + ICMLA 2019 paper (Samarth / Bhowmik / Breckon)
JeffersonQin/YuzuMarker.FontDetection
✨ 首个CJK(中日韩)字体识别以及样式提取模型 YuzuMarker的字体识别模型与实现 / First-ever CJK (Chinese Japanese Korean) Font Recognition and Style Extractor, side project of YuzuMarker
aim-uofa/Matcher
[ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Parskatt/DeDoDe
[3DV 2024 Oral] DeDoDe 🎶 Detect, Don't Describe --- Describe, Don't Detect, for Local Feature Matching
fabio-sim/LightGlue-ONNX
ONNX-compatible LightGlue: Local Feature Matching at Light Speed. Supports TensorRT, OpenVINO
Curt-Park/segment-anything-with-clip
Segment Anything combined with CLIP
RockeyCoss/Prompt-Segment-Anything
This is an implementation of zero-shot instance segmentation using Segment Anything.
robinreni96/Font_Recognition-DeepFont
Its a implementation of DeepFont : Identify Your Font from An Image using Keras
PacktPublishing/Qt-5-and-OpenCV-4-Computer-Vision-Projects
Qt 5 and OpenCV 4 Computer Vision Projects, published by Packt
Enzo-MiMan/cv_related_collections
MaybeShewill-CV/segment-anything-u-specify
using clip and sam to segment any instance you specify with text prompt of any instance names
fabio-sim/DeDoDe-ONNX-TensorRT
ONNX-compatible DeDoDe 🎶 Detect, Don't Describe - Describe, Don't Detect, for Local Feature Matching. Supports TensorRT 🚀
HemingwayLee/deepfont-implement