web-scraper

There are 1170 repositories under web-scraper topic.

firecrawl/firecrawl
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Language:TypeScript67k 257 7535.2k
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
Language:Python21.7k 135 4151.9k
getmaxun/maxun
⚡ Easiest no code web data extraction platform • Instantly turn any website into API or spreadsheet ⚡
Language:TypeScript13.8k 78 2741.1k
D4Vinci/Scrapling
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
Language:Python8.1k 50 44463
BruceDone/awesome-crawler
A collection of awesome web crawler,spider in different languages
7k 201 19733
jaypyles/Scraperr
Self-hosted webscraper.
Language:TypeScript4.7k 9 53239
arpit-omprakash/100ProjectsOfCode
A list of practical knowledge-building projects.
3.5k 103 5319
php-curl-class/php-curl-class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Language:PHP3.3k 159 407819
gosom/google-maps-scraper
scrape data data from Google Maps. Extracts data such as the name, address, phone number, website URL, rating, reviews number, latitude and longitude, reviews,email and more for each place
Language:Go2.4k 20 109314
anaskhan96/soup
Web Scraper in Go, similar to BeautifulSoup
Language:Go2.2k 34 44169
dipu-bd/lightnovel-crawler
Generate and download e-books from online sources.
Language:Python1.9k 39 1.7k378
itsOwen/CyberScraper-2077
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Language:Python1.9k 11 29176
oxylabs/google-ai-mode-scraper
Scrape Google AI Mode responses without blocks on a large scale.
Language:Java1.5k 2 014
oxylabs/how-to-scrape-amazon-product-data
The process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.
1.3k 1 03
juancarlospaco/faster-than-requests
Faster requests on Python 3
Language:Nim1.1k 18 14792
tholian-network/stealth
:rocket: Stealth - Secure, Peer-to-Peer, Private and Automateable Web Browser/Scraper/Proxy
Language:JavaScript1.1k 38 76323
gildas-lormeau/single-file-cli
CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)
Language:JavaScript1k 10 14299
Oshan96/monkey-dl
Bulk download your favourite anime episodes from your favourite anime websites
Language:Python861 22 5874
je-suis-tm/web-scraping
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Language:Python833 27 11185
postmodern/spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Language:Ruby827 23 65109
k0rnh0li0/onlyfans-dl
OnlyFans content downloader
Language:Python797 56 201227
cassidoo/scrapers
A list of scrapers from around the web.
695 23 3107
oxylabs/how-to-scrape-google-scholar
A guide for extracting titles, authors, and citations from Google Scholar using Python and Oxylabs SERP Scraper API.
Language:Python587 12 18
spekulatius/PHPScraper
A universal web-util for PHP.
Language:PHP572 16 7175
oxylabs/how-to-scrape-amazon-prices
A code for extracting best-selling items, search results, and currently available deals from Amazon using Python and Oxylabs E-Commerce Scraper API.
Language:Python532 7 06
jaebradley/basketball_reference_web_scraper
NBA Stats API via Basketball Reference
Language:HTML517 20 86125
oxylabs/quick-start-guide
Python quick start guides to get the most out of Oxylabs' Web Scraper API free trial.
516 1 13
0x676e67/wreq
An ergonomic Rust HTTP Client with TLS fingerprint
Language:Rust508 3 14063
austinoboyle/scrape-linkedin-selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Language:HTML508 25 90167
AlexMathew/scrapple
A framework for creating semi-automatic web content extractors
Language:Python503 23 1741
shaikhsajid1111/social-media-profile-scrapers
Fetch user's data across social media
Language:Python496 17 1781
paulpierre/markdown-crawler
A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
Language:Python414 6 1150
crwlrsoft/crawler
Library for Rapid (Web) Crawler and Scraper Development
Language:PHP366 4 2213
passivebot/facebook-marketplace-scraper
This repository contains a script to scrape Facebook Marketplace data using Playwright, BeautifulSoup and Streamlit.
Language:Python353 6 4103
lewisdonovan/google-news-scraper
Lightweight scraper for Google News
Language:TypeScript348 9 4268
oxylabs/web-unblocker
Free trial Web Unblocker - an AI-powered proxy solution that can bypass even the most sophisticated anti-bot systems.
Language:Python325 4 050

web-scraper

firecrawl/firecrawl

ScrapeGraphAI/Scrapegraph-ai

getmaxun/maxun

D4Vinci/Scrapling

BruceDone/awesome-crawler

jaypyles/Scraperr

arpit-omprakash/100ProjectsOfCode

php-curl-class/php-curl-class

gosom/google-maps-scraper

anaskhan96/soup

dipu-bd/lightnovel-crawler

itsOwen/CyberScraper-2077

oxylabs/google-ai-mode-scraper

oxylabs/how-to-scrape-amazon-product-data

juancarlospaco/faster-than-requests

tholian-network/stealth

gildas-lormeau/single-file-cli

Oshan96/monkey-dl

je-suis-tm/web-scraping

postmodern/spidr

k0rnh0li0/onlyfans-dl

cassidoo/scrapers

oxylabs/how-to-scrape-google-scholar

spekulatius/PHPScraper

oxylabs/how-to-scrape-amazon-prices

jaebradley/basketball_reference_web_scraper

oxylabs/quick-start-guide

0x676e67/wreq

austinoboyle/scrape-linkedin-selenium

AlexMathew/scrapple

shaikhsajid1111/social-media-profile-scrapers

paulpierre/markdown-crawler

crwlrsoft/crawler

passivebot/facebook-marketplace-scraper

lewisdonovan/google-news-scraper

oxylabs/web-unblocker