GGaoYipeng

GGaoYipeng's Stars

SpursGoZmy/Tabular-LLM
本项目旨在收集开源的表格智能任务数据集（比如表格问答、表格-文本生成等），将原始数据整理为指令微调格式的数据并微调LLM，进而增强LLM对于表格数据的理解，最终构建出专门面向表格智能任务的大型语言模型。
48238
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Language:Python6.2k533
VamosC/CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
Language:Python12515
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7k723
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.3k104
oh-my-ocr/text_renderer
Language:Python803161
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
Language:Python3.3k982
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
Language:Python5.6k1.1k
WenmuZhou/PytorchOCR
基于Pytorch的OCR工具库，支持常用的文字检测和识别算法
Language:Python1.4k306
frotms/PaddleOCR2Pytorch
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
Language:Python877174
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Language:Python2.1k481
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language:C++62.7k9.5k
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Language:Python4.4k754
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python44.6k7.9k
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python24.7k3.2k
IDEA-Research/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Language:Python2.3k257
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.5k176
kuleshov-group/llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
Language:Python70976
WeOpenML/PandaLM
Language:Python89167
open-mmlab/Multimodal-GPT
Multimodal-GPT
Language:Python1.5k126
ylsung/Ladder-Side-Tuning
PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"
Language:Python23210
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.6k1.6k
maximilianigl/DVRL
Deep Variational Reinforcement Learning
Language:Jupyter Notebook13425
DerrickXuNu/dvrl_pytorch
Data Valuation using Reinforcement Learning Pytorch Version
Language:Python15
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7.1k482
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.8k691
mczhuge/Kaleido-BERT
💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Language:Python26219
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python169k44.5k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.9k5.7k
google-research/google-research
Google Research
Language:Jupyter Notebook34.4k7.9k