absoluteml's Stars
ggerganov/llama.cpp
LLM inference in C/C++
xai-org/grok-1
Grok open release
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
paperless-ngx/paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
mayooear/gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
voxel51/fiftyone
Refine high-quality datasets and visual AI models
py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
flxzt/rnote
Sketch and take handwritten notes.
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Wandmalfarbe/pandoc-latex-template
A pandoc LaTeX template to convert markdown files to PDF or LaTeX.
pdfminer/pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
arcee-ai/mergekit
Tools for merging pretrained large language models.
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
atlanhq/camelot
Camelot: PDF Table Extraction for Humans
jorisschellekens/borb
borb is a library for reading, creating and manipulating PDF files in python.
Docta-ai/docta
A Doctor for your data
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
cuda-mode/lectures
Material for cuda-mode lectures
amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
visual-layer/fastdup
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
camelot-dev/excalibur
A web interface to extract tabular data from PDFs
cuda-mode/resource-stream
CUDA related news and material links
getAsterisk/blockoli
Blockoli is a high-performance tool for code indexing, embedding generation and semantic search tool for use with LLMs.