forrestr91

forrestr91's Stars

jlegewie/zotfile
Zotero plugin to manage your attachments: automatically rename, move, and attach PDFs (or other files) to Zotero items, sync PDFs from your Zotero library to your (mobile) PDF reader (e.g. an iPad, Android tablet, etc.), and extract PDF annotations.
Language:Java4k280
MuiseDestiny/zotero-reference
PDF references add-on for Zotero.
Language:JavaScript2k57
datawhalechina/so-large-lm
大模型基础: 一文了解大模型基础知识
2.6k230
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.3k3.9k
OleehyO/TexTeller
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
Language:Python30532
RapidAI/RapidLayout
Analysis of Chinese and English layouts 中英文版面分析
Language:Python967
doocs/leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer（第 2 版）》、《程序员面试金典（第 6 版）》题解
Language:Java31k6.8k
breezedeus/Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
Language:Jupyter Notebook1.8k175
run-llama/llama_parse
Parse files for optimal RAG
Language:Python2.5k251
breezedeus/CnOCR
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
Language:Python3.2k497
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language:Python12.1k993
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的中文/英文场景文字检测（Scene Text Detection）、数学公式检测（Mathematical Formula Detection, MFD）、篇章分析（Layout Analysis）的Python3 包
Language:Python672105
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.6k2.5k
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
Language:Python9.9k644
ZZZHANG-jx/DocRes
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Language:Python29128
xiaotudui/pytorch-tutorial
PyTorch深度学习快速入门教程（绝对通俗易懂！）
Language:Python2.6k610
Gmgge/TrOCR-Seal-Recognition
基于transformer的ocr识别，在公章(印章识别, seal recognition）拓展应用
Language:Python12924
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
Language:Python4.8k461
meta-llama/llama
Inference code for Llama models
Language:Python55.6k9.5k
Tongjilibo/bert4torch
An elegent pytorch implement of transformers
Language:Python1.2k153
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
Language:Python29836
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (Qwen2.5, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Language:Python3.5k299
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook67.6k10.1k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.3k1k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python31.6k4.7k
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Language:Jupyter Notebook3.7k1.1k
hikopensource/DAVAR-Lab-OCR
OCR toolbox from Davar-Lab
Language:Python732156
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python29.1k9.4k
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python42.8k7.7k
WenmuZhou/PytorchOCR
基于Pytorch的OCR工具库，支持常用的文字检测和识别算法
Language:Python1.4k305