shengyudingli's Stars
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
espnet/espnet
End-to-End Speech Processing Toolkit
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Ruzim/NSFC-application-template-latex
国家自然科学基金申请书正文(面上项目)LaTeX 模板(非官方)
huawei-noah/VanillaNet
qinzheng93/GeoTransformer
[CVPR2022] Geometric Transformer for Fast and Robust Point Cloud Registration
z-bingo/awesome-image-denoising-state-of-the-art
awesome image and video denoising, state of the art networks
ying09/TextFuseNet
A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".
FudanVI/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
opconty/Transformer_STR
PyTorch implementation of my new method for Scene Text Recognition (STR) based on Transformer,Equipped with Transformer, this method outperforms the best model of the aforementioned deep-text-recognition-benchmark by 7.6% on CUTE80.
aasharma90/RetinexNet_PyTorch
Unofficial PyTorch code for the paper - Deep Retinex Decomposition for Low-Light Enhancement, BMVC'18
fh2019ustc/DocScanner
The official repo for “DocScanner: Robust Document Image Rectification with Progressive Learning”, IJCV, 2025.
LMMMEng/TransXNet
[TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
gwxie/Document-Dewarping-with-Control-Points
Document Dewarping with Control Points
bytedance/SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
fh2019ustc/DocGeoNet
The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.
MingLunHan/CIF-PyTorch
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
vectorobject/faceswap
A GUI for roop, supports replacing faces specified in videos
bytedance/E2STR
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
gwxie/Synthesize-Distorted-Image-and-Its-Control-Points
Synthesize distorted document image and control points.
seungjun45/Water-Filling
My second year project under advisor Prof. Changick Kim (2015.09~2016.03). The technique is for removing illumination distortions for camera captured document images. Source Code Available
George0828Zhang/torch_cif
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.
onealwj/MVLT
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
crazycloud/Handwritten-text-Detection-Detectron2
Handwritten text detection in document images using Detectron2
Joran1101/YOLOv5-CBAM-Seal-Detection
This project uses YOLOv5 with integrated CBAM attention mechanism to perform seals object detection tasks, which can quickly detect circular and elliptical seals.
markytools/strexp
STRExp is a framework that provides Explainability (XAI) to Scene Text Recognition (STR) models.
alwc/synthetic_Chinese_OCR_dataset
Synthetic Chinese OCR dataset, mainly for bill photo recognition taken by mobile phone.
tataganesh/Query-Efficient-Approx-to-improve-OCR