tanphan07's Stars
autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
vietanhdev/samexporter
Export Segment Anything Models to ONNX
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
HYOJINPARK/TTVOS
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
detectRecog/PointTrack
PointTrack (ECCV2020 ORAL): Segment as Points for Efficient Online Multi-Object Tracking and Segmentation
vietanhdev/anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything, MobileSAM!!
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
qianqianwang68/omnimotion
hustvl/YOLOP
You Only Look Once for Panopitic Driving Perception.(MIR2022)
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
AlanLuSun/Few-shot-keypoint-detection
A novel few-shot keypoint detector with uncertainty learning for unseen species (CVPR2022).
thuml/Transfer-Learning-Library
Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization
dvlab-research/DecoupleNet
Official implementation for our ECCV 2022 paper "DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation"
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
lh9171338/Line-Segment-Detection-Papers
Line segment detection papers
dz-id/fb_get_token_from_cookie
Only with cookies you can take the fb access token very easily and no checkpoints
beacandler/EATEN
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
ricardobnjunior/Brazilian-Identity-Document-Dataset
Brazilian Identity Document Dataset (BID Dataset): The first public dataset of Brazilian identification documents.
mit-han-lab/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
lhwcv/mlsd_pytorch
Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"
HCIILAB/Scene-Text-Recognition
dhlab-epfl/dhSegment
Generic framework for historical document processing
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
cv-small-snails/Awesome-Table-Recognition
A curated list of resources dedicated to table recognition
MarkMoHR/Awesome-Edge-Detection-Papers
:books: A collection of edge/contour/boundary detection papers and toolbox.
samylee/Towards-Realtime-MOT-Cpp
A C++ codebase implementation of Towards-Realtime-MOT
triple-Mu/YOLOv8-TensorRT
YOLOv8 using TensorRT accelerate !