zhangjx123

Shanghai

zhangjx123's Stars

abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Language:Python66.6k 397 3548.1k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python23k 190 5242.3k
pyecharts/pyecharts
🎨 Python Echarts Plotting Library
Language:Python15k 380 1.9k2.9k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.4k 259 129850
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language:Python13.2k 74 2751.1k
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
Language:Python9.1k 63 215582
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71554
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
Language:Python2.5k 40 151201
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Language:Python2.1k 44 366480
Gsllchb/Handright
A lightweight Python library for simulating Chinese handwriting
Language:Python2.1k 18 42250
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Language:Python2k 33 125116
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1.9k 22 149132
pyecharts/pyecharts-gallery
Just use pyecharts to imitate Echarts official example.
Language:HTML1.2k 29 97589
KangLiao929/Awesome-Deep-Camera-Calibration
Deep Learning for Camera Calibration and Beyond: A Survey
625 35 562
hsfzxjy/handwriter.ttf
Handwriting synthesis with Harfbuzz WASM.
Language:Rust438 3 112
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
382 13 429
alvinwan/TexSoup
fault-tolerant Python3 package for searching, navigating, and modifying LaTeX documents
Language:Python294 9 10843
ymy-k/Hi-SAM
[TPAMI'24] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Language:Python229 12 2116
xinke-wang/Awesome-Text-VQA
189 10 311
UniModal4Reasoning/DocGenome
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
Language:Jupyter Notebook146 5 85
pdf-association/pdf-corpora
An index of PDF-centric corpora
116 12 28
Kamino666/watermark-tracer
一个基于可视水印检测识别的数字媒体溯源应用系统，是我的大作业项目，包含这个系统以及一个开源的大规模常见水印图像数据集（Large-scale Common Watermark Dataset, LCWD）。输入一个带有可视水印的图片或视频，系统会检测定位到水印所在的区域，然后将其提取出来，然后借助百度AI开放平台的OCR和logo识别以及Bing搜索引擎，溯源到这个图片或视频的源头。
Language:Python108 4 714
mxin262/ESTextSpotter
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Language:Python72 3 227
mxin262/Bridging-Text-Spotting
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
Language:Python51 2 121
bytedance/E2STR
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
Language:Python47 6 44
shannanyinxiang/UPOCR
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
Language:Python43 2 44
Yuliang-Liu/Open-Oracle
AI-assisted Deciphering Oracle Bone Script
41 3 00
bzluan/TextCoT
The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.
Language:Python38 3 54
ZZZHANG-jx/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
2 0 00
mxin262/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1 0 00

zhangjx123

zhangjx123's Stars

abi/screenshot-to-code

hpcaitech/Open-Sora

pyecharts/pyecharts

BradyFU/Awesome-Multimodal-Large-Language-Models

lukas-blecher/LaTeX-OCR

facebookresearch/nougat

LargeWorldModel/LWM

mit-han-lab/efficientvit

MhLiao/DB

Gsllchb/Handright

X-PLUG/mPLUG-DocOwl

Yuliang-Liu/Monkey

pyecharts/pyecharts-gallery

KangLiao929/Awesome-Deep-Camera-Calibration

hsfzxjy/handwriter.ttf

fh2019ustc/Awesome-Document-Image-Rectification

alvinwan/TexSoup

ymy-k/Hi-SAM

xinke-wang/Awesome-Text-VQA

UniModal4Reasoning/DocGenome

pdf-association/pdf-corpora

Kamino666/watermark-tracer

mxin262/ESTextSpotter

mxin262/Bridging-Text-Spotting

bytedance/E2STR

shannanyinxiang/UPOCR

Yuliang-Liu/Open-Oracle

bzluan/TextCoT

ZZZHANG-jx/Awesome-Document-Image-Rectification

mxin262/Monkey