mindofsteel's Stars
yigitkonur/swift-ocr-llm-powered-pdf-to-markdown
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex PDF documents. Ideal for businesses seeking efficient document digitization and data extraction solutions.
langwatch/langwatch
The ultimate LLM Ops platform - Monitoring, Analytics, Evaluations, Datasets and Prompt Optimization ✨
orasik/parsevision
Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help developers and product teams identify if the parsing has missed some vital information from the document.
sindresorhus/awesome
😎 Awesome lists about all kinds of interesting topics
prakhar1989/awesome-courses
:books: List of awesome university courses for learning Computer Science!
hay-kot/homebox
Homebox is the inventory and organization system built for the Home User
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
r2d4/osrs-ocr
topoteretes/cognee
Reliable LLM Memory for AI Applications and AI Agents
nerority/Prompt-Engineering-Mastery
anthropics/anthropic-cookbook
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
intellectronica/text-clustering-embedding-vs-prompting
Demo of text clustering using two techniques: learning clusters from their embeddings, and prompting an LLM to do the work for us
tunaflsh/summarizer
Summarizes texts, videos and audios recursively. Allows custom prompts.
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
aurelio-labs/semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
for-ai/aya-annotations-ui
Web UI & Backend for Data Annotations in Aya
Marker-Inc-Korea/AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
shigapov/wikibase-knowledge-graphs
A collection of open source tools and resources related to Wikibase knowledge graphs
nateburley/WikiKnowledgeGraphs
Knowledge Graph constructed from Wikipedia
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
espnet/espnet
End-to-End Speech Processing Toolkit
chentong0/factoid-wiki
Dense X Retrieval: What Retrieval Granularity Should We Use?
JayZeeDesign/research-agents-3.0
Autogen + GPTs - build a swarm AI researchers
kadirnar/whisper-plus
WhisperPlus: Faster, Smarter, and More Capable 🚀
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
vidstack/captions
Modern media captions parser and renderer (~5kB). Supports VTT, SRT, and SSA. Works server side, supports text streams, rollup captions via VTT regions, customization via CSS, and more.
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
VikParuchuri/texify
Math OCR model that outputs LaTeX and markdown