cimentadaj
Build AI tools freelancing. Previously @ryanair, @odigeoteam and @MPIDR
Senior Data ScientistMadrid
cimentadaj's Stars
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
mindsdb/mindsdb
AGI's query engine - Platform for building AI that can learn and answer questions over federated data.
unclecode/crawl4ai
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
pgvector/pgvector
Open-source vector similarity search for Postgres
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
pathwaycom/llm-app
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
Doriandarko/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claude to generate and manage its own tools, continuously expanding its capabilities through conversation. Available both as a CLI and a modern web interface
subzeroid/instagrapi
🔥 The fastest and powerful Python library for Instagram Private API 2025 with HikerAPI SaaS
SciPhi-AI/R2R
The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a RESTful API.
briefercloud/briefer
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
midday-ai/v1
An open-source starter kit based on Midday.
Marker-Inc-Korea/AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
mishushakov/llm-scraper
Turn any webpage into structured data using LLMs
omkarcloud/botasaurus
The All in One Framework to build Awesome Scrapers.
KruxAI/ragbuilder
A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data
oxylabs/google-maps-scraper
Google Maps Scraper for collecting data from various Google Maps listings, including business profiles.
ulixee/hero
The web browser built for scraping
daijro/camoufox
🦊 Anti-detect browser
2captcha/2captcha-python
Python 3 package for easy integration with the API of 2captcha captcha solving service to bypass recaptcha, сloudflare turnstile, funcaptcha, geetest and solve any other captchas.
AI-Commandos/RAGMeUp
Generic rag framework to apply the power of LLMs on any given dataset
daijro/browserforge
🎭 Intelligent browser header & fingerprint generator
Aavache/LLMWebCrawler
A Web Crawler based on LLMs implemented with Ray and Huggingface. The embeddings are saved into a vector database for fast clustering and retrieval. Use it for your RAG.
AnswerDotAI/bert24
ispras/scrapy-puppeteer
Library that helps use puppeteer in scrapy.
khalilbenkhaled/yaraa
Yaraa (Yet Another Rag Automation Attempt) is a library that tackles the boring aspects of managing Rag pipelines, so you don't have to.