GGaoYipeng's Stars
SpursGoZmy/Tabular-LLM
本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
VamosC/CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
oh-my-ocr/text_renderer
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
WenmuZhou/PytorchOCR
基于Pytorch的OCR工具库,支持常用的文字检测和识别算法
frotms/PaddleOCR2Pytorch
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
IDEA-Research/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
kuleshov-group/llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
WeOpenML/PandaLM
open-mmlab/Multimodal-GPT
Multimodal-GPT
ylsung/Ladder-Side-Tuning
PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
maximilianigl/DVRL
Deep Variational Reinforcement Learning
DerrickXuNu/dvrl_pytorch
Data Valuation using Reinforcement Learning Pytorch Version
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
mczhuge/Kaleido-BERT
💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
google-research/google-research
Google Research