jaykabra's Stars
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
LonamiWebs/Telethon
Pure Python 3 MTProto API Telegram client library, for bots too!
tabulapdf/tabula
Tabula is a tool for liberating data tables trapped inside PDF files
bradtraversy/react-crash-2021
Task tracking application from the React crash course
dataprofessor/code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
AlessandroCorradini/University-of-California-San-Diego-Big-Data-Specialization
Repository for the Big Data Specialization from University of California San Diego on Coursera
kstathou/vector_engine
Build a semantic search engine with Transformers and Faiss
larrybotha/sql-for-data-science
Notes, annotations, and exercises from Coursera's SQL for Data Science course: https://www.coursera.org/learn/sql-for-data-science