AbsoluteML

AbsoluteML's Stars

stitionai/blockoli
Blockoli is a high-performance tool for code indexing, embedding generation and semantic search tool for use with LLMs.
Language:Rust7612
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python4k463
pdfminer/pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
Language:Python5.7k906
camelot-dev/excalibur
A web interface to extract tabular data from PDFs
Language:HTML1.5k221
atlanhq/camelot
Camelot: PDF Table Extraction for Humans
Language:Python3.6k350
jorisschellekens/borb
borb is a library for reading, creating and manipulating PDF files in python.
Language:Python3.3k148
flxzt/rnote
Sketch and take handwritten notes.
Language:Rust6.5k222
Wandmalfarbe/pandoc-latex-template
A pandoc LaTeX template to convert markdown files to PDF or LaTeX.
Language:TeX6k955
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
Language:Python11k1.2k
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Language:Python13.1k965
mayooear/gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
Language:TypeScript14.8k3k
paperless-ngx/paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Language:Python18.2k988
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java30.8k2.3k
py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Language:Python7.8k1.4k
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
Language:Python2.1k235
jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Language:Python6.1k623
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++61.9k8.9k
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.1k359
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Language:Python6.6k759
amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
Language:Python2.1k246
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook14.2k1.3k
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook2.6k247
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python5.7k604
visual-layer/fastdup
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
Language:Python1.5k74
Docta-ai/docta
A Doctor for your data
Language:Python3.1k189
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
Language:Python7.9k521
xai-org/grok-1
Grok open release
Language:Python49.2k8.3k
cuda-mode/resource-stream
CUDA related news and material links
97360
cuda-mode/lectures
Material for cuda-mode lectures
Language:Jupyter Notebook1.9k179
lucidrains/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python4.4k373