pai-sr

pai-sr's Stars

FareedKhan-dev/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
Language:Jupyter Notebook9328
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.6k1.1k
NomaDamas/awesome-korean-llm
Awesome list of Korean Large Language Models.
43927
docker/genai-stack
Langchain + Docker + Neo4j + Ollama
Language:Python4k856
Filimoa/open-parse
Improved file parsing for LLM’s
Language:Python2.5k98
ai2-ner-project/pytorch-ko-ner
PLM 기반 한국어 개체명 인식 (NER)
Language:Python283
Marker-Inc-Korea/AutoRAG
AutoML tool for RAG
Language:Python2.2k172
teddylee777/langchain-kr
LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.
Language:Jupyter Notebook1.1k273
HeegyuKim/open-korean-instructions
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
Language:Python35026
NaverCloudPlatform/android-ai-sample
Language:Java1213
microsoft/OCR-Form-Tools
A set of tools to use in Microsoft Azure Form Recognizer and OCR services.
Language:TypeScript518175
hoya012/deep_learning_object_detection
A paper list of object detection using deep learning.
Language:Python11.3k2.8k
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python24.3k3.1k
JiaquanYe/TableMASTER-mmocr
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
Language:Python434104
PaddleOCR-Community/Dive-into-OCR
“Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.
Language:Jupyter Notebook21759
philschmid/document-ai-transformers
Language:Jupyter Notebook32749
katanaml/sparrow
Data processing with ML and LLM
Language:Python3.6k371
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language:Python5.8k472
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Language:Jupyter Notebook3.7k1.1k
Wongi-Choi1014/Korean-OCR-Model-Design-based-on-Keras-CNN
Korean OCR Model Design(한글 OCR 모델 설계)
Language:Python7625
doc-analysis/TableBank
TableBank: A Benchmark Dataset for Table Detection and Recognition
1k141
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python29.5k9.4k
DevashishPrasad/CascadeTabNet
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
Language:Python1.5k427
Roll-Face/table_extraction
extract information from tubular data
Language:Python7
jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Language:Python6.7k663
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
Language:Python2.3k253
zacharywhitley/awesome-ocr
878108
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Language:Python3.8k433
google/winops
Small scripts and libraries for managing Windows in a corporate environment.
Language:Go10913
Justin-Lund/PS-Remote-Support
PowerShell Script for Remote Support & Administration
Language:PowerShell437