AbsoluteML's Stars
stitionai/blockoli
Blockoli is a high-performance tool for code indexing, embedding generation and semantic search tool for use with LLMs.
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
pdfminer/pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
camelot-dev/excalibur
A web interface to extract tabular data from PDFs
atlanhq/camelot
Camelot: PDF Table Extraction for Humans
jorisschellekens/borb
borb is a library for reading, creating and manipulating PDF files in python.
flxzt/rnote
Sketch and take handwritten notes.
Wandmalfarbe/pandoc-latex-template
A pandoc LaTeX template to convert markdown files to PDF or LaTeX.
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
mayooear/gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
paperless-ngx/paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
ggerganov/llama.cpp
LLM inference in C/C++
arcee-ai/mergekit
Tools for merging pretrained large language models.
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
visual-layer/fastdup
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
Docta-ai/docta
A Doctor for your data
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
xai-org/grok-1
Grok open release
cuda-mode/resource-stream
CUDA related news and material links
cuda-mode/lectures
Material for cuda-mode lectures
lucidrains/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers