pai-sr's Stars
FareedKhan-dev/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
NomaDamas/awesome-korean-llm
Awesome list of Korean Large Language Models.
docker/genai-stack
Langchain + Docker + Neo4j + Ollama
Filimoa/open-parse
Improved file parsing for LLM’s
ai2-ner-project/pytorch-ko-ner
PLM 기반 한국어 개체명 인식 (NER)
Marker-Inc-Korea/AutoRAG
AutoML tool for RAG
teddylee777/langchain-kr
LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.
HeegyuKim/open-korean-instructions
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
NaverCloudPlatform/android-ai-sample
microsoft/OCR-Form-Tools
A set of tools to use in Microsoft Azure Form Recognizer and OCR services.
hoya012/deep_learning_object_detection
A paper list of object detection using deep learning.
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
JiaquanYe/TableMASTER-mmocr
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
PaddleOCR-Community/Dive-into-OCR
“Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.
philschmid/document-ai-transformers
katanaml/sparrow
Data processing with ML and LLM
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Wongi-Choi1014/Korean-OCR-Model-Design-based-on-Keras-CNN
Korean OCR Model Design(한글 OCR 모델 설계)
doc-analysis/TableBank
TableBank: A Benchmark Dataset for Table Detection and Recognition
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
DevashishPrasad/CascadeTabNet
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
Roll-Face/table_extraction
extract information from tubular data
jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
zacharywhitley/awesome-ocr
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
google/winops
Small scripts and libraries for managing Windows in a corporate environment.
Justin-Lund/PS-Remote-Support
PowerShell Script for Remote Support & Administration