tanphan07

tanphan07's Stars

autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
Language:Python1.7k131
vietanhdev/samexporter
Export Segment Anything Models to ONNX
Language:Python18323
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Language:Jupyter Notebook2.6k320
HYOJINPARK/TTVOS
Language:Python11
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Language:Python6.3k462
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
Language:Python7.1k665
detectRecog/PointTrack
PointTrack (ECCV2020 ORAL): Segment as Points for Efficient Online Multi-Object Tracking and Segmentation
Language:Python26045
vietanhdev/anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything, MobileSAM!!
Language:Python2k218
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
Language:Python2.7k334
qianqianwang68/omnimotion
Language:Python2.1k119
hustvl/YOLOP
You Only Look Once for Panopitic Driving Perception.（MIR2022）
Language:Python1.9k403
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.4k481
AlanLuSun/Few-shot-keypoint-detection
A novel few-shot keypoint detector with uncertainty learning for unseen species (CVPR2022).
Language:Python292
thuml/Transfer-Learning-Library
Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization
Language:Python3.2k544
dvlab-research/DecoupleNet
Official implementation for our ECCV 2022 paper "DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation"
Language:Python372
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Language:Python3.3k397
lh9171338/Line-Segment-Detection-Papers
Line segment detection papers
23527
dz-id/fb_get_token_from_cookie
Only with cookies you can take the fb access token very easily and no checkpoints
Language:Python3823
beacandler/EATEN
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
17217
ricardobnjunior/Brazilian-Identity-Document-Dataset
Brazilian Identity Document Dataset (BID Dataset): The first public dataset of Brazilian identification documents.
5810
mit-han-lab/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
Language:Python1.6k141
lhwcv/mlsd_pytorch
Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"
Language:Python18336
HCIILAB/Scene-Text-Recognition
602117
dhlab-epfl/dhSegment
Generic framework for historical document processing
Language:Python370116
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
Language:Python2k231
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
33630
cv-small-snails/Awesome-Table-Recognition
A curated list of resources dedicated to table recognition
33647
MarkMoHR/Awesome-Edge-Detection-Papers
:books: A collection of edge/contour/boundary detection papers and toolbox.
1.3k251
samylee/Towards-Realtime-MOT-Cpp
A C++ codebase implementation of Towards-Realtime-MOT
Language:C++11824
triple-Mu/YOLOv8-TensorRT
YOLOv8 using TensorRT accelerate !
Language:C++1.2k209