chinaphilip's Stars
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Skyvern-AI/skyvern
Automate browser-based workflows with LLMs and Computer Vision
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
bentoml/OpenLLM
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
jina-ai/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
wainshine/Chinese-Names-Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
automeris-io/WebPlotDigitizer
Computer vision assisted tool to extract numerical data from plot images.
median-research-group/LibMTL
A PyTorch Library for Multi-Task Learning
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
noamgat/lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
google-research/pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
quqxui/Awesome-LLM4IE-Papers
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
opendatalab/DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
google-research/pix2struct
teacherpeterpan/self-correction-llm-papers
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
hitz-zentroa/GoLLIE
Guideline following Large Language Model for Information Extraction
opendatalab/UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
LingyvKong/OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
shannanyinxiang/SPTS
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
1694439208/GOT-OCR-Inference
研究GOT-OCR-项目落地加速,不限语言
Ucas-HaoranWei/Vary-family
alan-tsang/overleaf-latex-chinese-english-general-template
一个overleaf latex中英文通用模板,元素丰富,适合latex入门;An Overleaf LaTeX Chinese and English universal template, rich in elements, suitable for getting started with LaTeX;
shannanyinxiang/UPOCR
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)