lyhh123

computer technology master student.

ChengDu, China

lyhh123's Stars

UniModal4Reasoning/DocGenome
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
Language:Jupyter Notebook1304
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Language:Python5.6k461
dailenson/One-DM
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
Language:Python26322
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python2.7k156
Topdu/OpenOCR
Language:Python17518
hiroi-sora/GapTree_Sort_Algorithm
【间隙·树·排序算法】对OCR结果或PDF提取的文本进行版面分析，按人类阅读顺序进行排序。
Language:Python9713
CVCUDA/CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
Language:C++2.4k216
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化的设计和执行爬虫任务。别名：ServiceWrapper面向Web应用的智能化服务封装系统。
Language:JavaScript35.1k4.3k
lyhh123/MTF-110K
A Comprehensive Dataset for Mixed Text and Formula Recognition in Educational and Scientific Documents
1
dailenson/SDT
This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR 2023)
Language:Python1k85
lllyasviel/IC-Light
More relighting!
Language:Python5k342
whai362/PVT
Official implementation of PVT series
Language:Python1.7k244
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python43.6k7.8k
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python65k8k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.8k453
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.4k170
LiuHC0428/LAW-GPT
中文法律对话语言模型
Language:Python1k116
Ucas-HaoranWei/Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Language:Python59845
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
Language:Python8.9k562
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
Language:Python46129
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.8k3k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.8k1.3k
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1.8k126
Pythagora-io/gpt-pilot
The first real AI developer
Language:Python31.5k3.2k
chineseocr/trocr-chinese
transformers ocr for chinese
Language:Python35154
mamba-org/mamba
The Fast Cross-Platform Package Manager
Language:C++6.9k353
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
13.4k1.4k
thuml/iTransformer
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
Language:Python1.2k215
megvii-research/NAFNet
The state-of-the-art image restoration model without nonlinear activation functions.
Language:Python2.2k270
HCIILAB/Scene-Text-Recognition
604118

lyhh123

lyhh123's Stars

UniModal4Reasoning/DocGenome

Ucas-HaoranWei/GOT-OCR2.0

dailenson/One-DM

QwenLM/Qwen2-VL

Topdu/OpenOCR

hiroi-sora/GapTree_Sort_Algorithm

CVCUDA/CV-CUDA

NaiboWang/EasySpider

lyhh123/MTF-110K

dailenson/SDT

lllyasviel/IC-Light

whai362/PVT

PaddlePaddle/PaddleOCR

binary-husky/gpt_academic

OpenGVLab/InternVL

AlibabaResearch/AdvancedLiterateMachinery

LiuHC0428/LAW-GPT

Ucas-HaoranWei/Vary-toy

facebookresearch/nougat

Yuliang-Liu/MultimodalOCR

meta-llama/llama3

Dao-AILab/flash-attention

Yuliang-Liu/Monkey

Pythagora-io/gpt-pilot

chineseocr/trocr-chinese

mamba-org/mamba

dair-ai/ml-visuals

thuml/iTransformer

megvii-research/NAFNet

HCIILAB/Scene-Text-Recognition