YRQ66's Stars
olibridge01/TeXOCR
Optical Character Recognition (OCR) model for Image-to-LaTeX conversion
RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
Eleanoreee/latex-OCR
attempt to revise existing methods
ZN1010/PEaCE
[LREC-COLING 2024] PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents. Boost OCR Performance on Scientific Documents.
haandfeng/transfomer-restnet-for-latex-and-Chinese
基于transfomer和restnet的文字和数学公式识别
iFLYTEK-CV/EDU-CHEMC
A handwritten Chemical Structure Image data set named EDU-CHEMC, which consists of totally 52,987 handwritten molecular structure images collected in educational scenarios.
longchentian/Pix2Text-nougat-texify-GUI
GUI for offline LaTex OCR tool for Pix2Text nougat texify three models:用于Pix2Text-nougat-texify三个模型的离线LaTex-OCR的工具的GUI
MrSingh-bytes/MathOCR
Math Formula (MathML) Complexity Analyzer and LaTeX formula Comparison
chaodreaming/doc2x
Convert documents to md, latex, etc. with detection models and ocr models
KyuDan1/TeX2Image
OleehyO/TexTeller
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
longxiaofei/spider-BaiduIndex
data sdk for baidu Index
ZL-ZedeL/NeteaseSearchCommentAnalysis
网易云音乐评论爬虫与评论可视化
mnt-ltd/moredoc
moredoc,魔豆文库,基于golang开发的类似百度文库的开源文库系统,dochub文库的重构版本。
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
liuhuanyong/ChainKnowledgeGraph
ChainKnowledgeGraph, 产业链知识图谱包括A股上市公司、行业和产品共3类实体,包括上市公司所属行业关系、行业上级关系、产品上游原材料关系、产品下游产品关系、公司主营产品、产品小类共6大类。 上市公司4,654家,行业511个,产品95,559条、上游材料56,824条,上级行业480条,下游产品390条,产品小类52,937条,所属行业3,946条。
wenhwu/awesome-remote-sensing-change-detection
List of datasets, codes, and contests related to remote sensing change detection
mrpositron/paper2tex
Extracting LaTeX equations from PDF
RapidAI/RapidLaTeXOCR
Formula recognition based on LaTeX-OCR and ONNXRuntime.
bcaitech1/p4-fr-ocr-oriental-chicken-curry
p4-fr-ocr-oriental-chicken-curry created by GitHub Classroom
justinzm/gopup
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
tianchiguaixia/medical_ocr_streamlit
该项目主要是为了识别图片里面的表格数据,并将表格数据抽取处理,导出成csv的文件。整个项目会使用streamlit进行部署和展示。使用的技术:paddleocr,PPStructure,streamlit
hasibzunair/cv-pytorch-tutorials
Tutorials on deep learning in computer vision with PyTorch.
MODCT/Celery-LaTex-OCR
Another LaTex formula OCR tool
tuiiitendinh/LaTeX-ConvNeXt
MODCT/CeleryMath
Another LaTex equation OCR tool based on ConvNeXt and Transformer
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
karpathy/nn-zero-to-hero
Neural Networks: Zero to Hero
wagtail/wagtail
A Django content management system focused on flexibility and user experience
tal-tech/SAN
Syntax-Aware Network for Handwritten Mathematical Expression Recognition