yichen2017's Stars
datawhalechina/leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
obss/sahi
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
philschmid/document-ai-transformers
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
RapidAI/RapidStructure
版面分析 | 表格识别 | 文档方向分类
breezedeus/Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
bin123apple/AutoCoder
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
polyaxon/haupt
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
ShusenTang/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
lansinuote/Simple_PyTorch2
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
tabulapdf/tabula
Tabula is a tool for liberating data tables trapped inside PDF files
wenwenyu/MASTER-pytorch
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
ZZR8066/SEMv2
MathamPollard/awesome-table-structure-recognition
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
unit-mesh/auto-dev-vscode
AutoDev - 🧙the AI-powered coding wizard . Put the most loved AutoDev AI assistant into your VSCode, and have things done quickly
DevashishPrasad/CascadeTabNet
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
JiaquanYe/TableMASTER-mmocr
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
lahoud/3d-vision-transformers
A list of 3D computer vision papers with Transformers
blackinkkkxi/RAG_langchain
一个基于langchain实现RAG的简单示例
Filimoa/open-parse
Improved file parsing for LLM’s
carlrobertoh/CodeGPT
The leading open-source AI copilot for JetBrains. Connect to any model in any environment, and customize your coding experience in any way you like.
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
CASIA-IVA-Lab/AnomalyGPT
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
opendatalab/WanJuan1.0
万卷1.0多模态语料
xiaomabenten/system-analysts
💯2024年系统分析师(软考高级)备考资源库。