icecream-Tnak's Stars
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
lixin4ever/Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
bighuang624/AI-research-tools
:hammer:AI 方向好用的科研工具
keyu-tian/SparK
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
TorchSSL/TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
baudm/parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
yeungchenwa/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
IDEA-Research/DAB-DETR
[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"
whai362/pan_pp.pytorch
Official implementations of PSENet, PAN and PAN++.
FudanVI/FudanOCR
A toolbox of scene text super-resolution and recognition
HCIILAB/Scene-Text-Recognition-Recommendations
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
fei-aiart/courses
课件:数字图像处理,深度学习,计算机视觉,机器学习
wenwenyu/MASTER-pytorch
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
houbb/nlp-hanzi-similar
The hanzi similar tool.(汉字相似度计算工具,中文形近字算法。可用于手写汉字识别纠正,文本混淆等。)
csxmli2016/MARCONet
Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]
bupt-ai-cz/Meta-SelfLearning
Meta Self-learning for Multi-Source Domain Adaptation: A Benchmark
clin1223/VLDet
[ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)
yflv-yanxia/scene_text
TongkunGuan/SIGA
[CVPR2023] Self-supervised Implicit Glyph Attention for Text Recognition
xiaoachen98/DALN
[CVPR2022] Official implementation of DALN.
AprilYapingZhang/Seq2SeqAdapt
Adversarial Sequence-to-sequence Domain Adaptation Network dubbed ASSDA for robust text image recognition
BADBADBADBOY/OCR-TextRecog
useful text recognition algorithms, CRNN and SVTR text recognition
comojin1994/DFformer
Towards Domain Free Transformer for Generalized EEG Pre-training
BOYang-pro/LFDT-Fusion
The code of "LFDT-Fusion: A Latent Feature-guided Diffusion Transformer Model for General Image Fusion"
wenwenyu/AudioOCR
Looking and Listening: Audio Guided Text Recognition
ML-HDU/LBL_LBLSig
ML-HDU/MRAE