forrestr91's Stars
jlegewie/zotfile
Zotero plugin to manage your attachments: automatically rename, move, and attach PDFs (or other files) to Zotero items, sync PDFs from your Zotero library to your (mobile) PDF reader (e.g. an iPad, Android tablet, etc.), and extract PDF annotations.
MuiseDestiny/zotero-reference
PDF references add-on for Zotero.
datawhalechina/so-large-lm
大模型基础: 一文了解大模型基础知识
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
OleehyO/TexTeller
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
RapidAI/RapidLayout
Analysis of Chinese and English layouts 中英文版面分析
doocs/leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
breezedeus/Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
run-llama/llama_parse
Parse files for optimal RAG
breezedeus/CnOCR
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
ZZZHANG-jx/DocRes
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
xiaotudui/pytorch-tutorial
PyTorch深度学习快速入门教程(绝对通俗易懂!)
Gmgge/TrOCR-Seal-Recognition
基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
meta-llama/llama
Inference code for Llama models
Tongjilibo/bert4torch
An elegent pytorch implement of transformers
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (Qwen2.5, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
CompVis/stable-diffusion
A latent text-to-image diffusion model
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
hikopensource/DAVAR-Lab-OCR
OCR toolbox from Davar-Lab
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
WenmuZhou/PytorchOCR
基于Pytorch的OCR工具库,支持常用的文字检测和识别算法