ai-scraping

There are 14 repositories under ai-scraping topic.

  • mendableai/firecrawl

    🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

    Language:TypeScript35k1865603.1k
  • Scrapling

    D4Vinci/Scrapling

    🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!

    Language:Python7.3k4333410
  • itsOwen/CyberScraper-2077

    A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

    Language:Python1.8k1225162
  • raznem/parsera

    Lightweight library for scraping web-sites with LLMs

    Language:Python1.2k161264
  • devflowinc/firecrawl-simple

    ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are completely removed. Crawl and convert any website into LLM-ready markdown.

    Language:TypeScript51902145
  • mendableai/firecrawl-app-examples

    🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.

    Language:TypeScript2833064
  • ArchiveBox/abx-dl

    ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

    Language:JavaScript83514
  • kaymen99/ai-web-scraper

    AI web scraper built with Crawl4AI for extracting structured leads data from websites.

    Language:Python461011
  • spider-rs/web-crawling-guides

    How to guides on web-crawling or scraping

  • spider-rs/spider-clients

    Python, Javascript, and Rust libraries for the Spider Cloud API.

    Language:Python19398
  • nathabonfim59/md-fetch

    A CLI tool and REST API that converts web content to clean Markdown, bypassing anti-scraping measures using headless browsers. Perfect for AI/LLM applications

    Language:Go2100
  • nsourlos/AI_tools

    AI tools to enhance productivity and automate web-scraping

    Language:Jupyter Notebook110
  • jenslys/skrape-js

    TypeScript/Node.js SDK to easily interact with the skrape.ai API

    Language:TypeScript0100
  • skrapeai/examples

    This repository contains complete application examples, developed using Skrape.ai