ir2718's Stars
jlevy/the-art-of-command-line
Master the command line, in one page
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
dair-ai/ML-YouTube-Courses
📺 Discover the latest machine learning / AI courses on YouTube.
stas00/ml-engineering
Machine Learning Engineering Open Book
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
KevinMusgrave/pytorch-metric-learning
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Peterande/D-FINE
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
MultimediaTechLab/YOLO
An MIT License of YOLOv9, YOLOv7, YOLO-RD
speedyapply/2025-AI-College-Jobs
2025 AI/ML internship & new graduate job list updated daily
OML-Team/open-metric-learning
Metric learning and retrieval pipelines, models and zoo.
kodestan/tank-ops
Topdu/OpenOCR
OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.
skeskinen/bert.cpp
ggml implementation of BERT
FudanVI/FudanOCR
A toolbox of scene text super-resolution and recognition
ENSTA-U2IS-AI/torch-uncertainty
Open-source framework for uncertainty and deep learning models in PyTorch :seedling:
m-bain/frozen-in-time
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
yardenfren1996/B-LoRA
Implicit Style-Content Separation using B-LoRA
pystiche/pystiche
Framework for Neural Style Transfer (NST) built upon PyTorch
mindspore-lab/mindocr
A toolbox of ocr models and algorithms based on MindSpore
Mountchicken/Union14M
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
line/lighthouse
[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
cv-small-snails/Text-Recognition-Material
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
mxin262/Bridging-Text-Spotting
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
herobd/NAF_dataset
Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.
josipjukic/alanno
Rong-Zou/Retrieval-Robust-to-Object-Motion-Blur
Pytorch code for the ECCV 2024 paper: Retrieval Robust to Object Motion Blur
Martinsos/Rijecalica
Program for winning the mobile app "Rijecalica"
kuznetsoffandrey/SportLogo
A new sport teams logo dataset for detection tasks