web-scraping-python

There are 315 repositories under web-scraping-python topic.

scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Language:Python58.9k 1.8k 3.2k11.1k
seleniumbase/SeleniumBase
Python APIs for web automation, testing, and bypassing bot-detection with ease.
Language:Python11.9k 163 1.9k1.4k
D4Vinci/Scrapling
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
Language:Python8.1k 50 44464
omkarcloud/botasaurus
The All in One Framework to Build Undefeatable Scrapers
Language:Python3.2k 26 204264
oxylabs/how-to-scrape-amazon-product-data
The process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.
1.4k 1 03
oxylabs/oxylabs-ai-studio-py
Structured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio python SDK for intelligent web data gathering.
Language:Python1.3k 4 09
tinyfish-io/agentql
AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.
Language:Python1k 20 8127
0x676e67/rnet
An ergonomic Python HTTP Client with TLS fingerprint
Language:Rust997 9 10879
scrapfly/scrapfly-scrapers
Scalable Python web scraping scripts for +40 popular domains
Language:Python749 15 23160
davidteather/everything-web-scraping
Learn everything web scraping with David Teather Codes on YouTube
Language:HTML433 4 586
oxylabs/Python-Web-Scraping-Tutorial
In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.
Language:Python298 1 032
drshahizan/python-web
This topic explains how to implement web scraping and python web development. Web scraping topics such as scrapy, beautiful soup, and others will be covered. A case study based on a Malaysian website.
Language:Jupyter Notebook128 3 065
thewebscraping/tls-requests
TLS Requests is a powerful Python library for secure HTTP requests, offering browser-like TLS client, fingerprinting, anti-bot page bypass, and high performance.
Language:Python108 2 219
oxylabs/web-scraping-google-sheets
Guide to Using Google Sheets for Basic Web Scraping
83 1 03
DataCrawl-AI/datacrawl
A simple and easy to use web crawler for Python
Language:Python64 8 2211
vishwajeetdabholkar/eGet-Crawler-for-ai
Web scraping framework built for AI applications. Extract clean, structured content from any website with dynamic content handling, markdown conversion, and intelligent crawling capabilities. Perfect for RAG applications and AI training data pipelines. Features async processing, browser management, and Prometheus monitoring.
Language:Python47 2 015
mike-gee/webtranspose
Web scraping API for building AI applications.
Language:Python40 1 42
GoncaloMark/CobWeb-lnx
CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.
Language:Python38 2 12
mhmdkardosha/CAT-Reloaded-2025-Data-Science-Roadmap
Roadmap for Data Science circle associated with CAT Reloaded.
342
Decodo/Python-scraper-tutorial
A short introduction to scraping with Python with given steps and an example scraper script.
Language:Python32 2 16
raymelon/tagalog-dictionary-scraper
Builds a Tagalog dictionary by collecting Tagalog words from tagalog.pinoydictionary.com
Language:Python30 0 216
kvcops/Deep-Research-using-Gemini-api
AI-powered deep research tool leveraging web scraping for cost-effective, comprehensive analysis. Open-source and API-cost free!
Language:HTML21 1 14
Narenpradhan/WatchTower
WatchTower - A platform to save your valuable time while staying updated in the Cyber realm.
Language:Python18 1 01
PB2204/Covid-19
This Is A Web Scraping Projects With Covid-19 Data From 2 Very Popular & Authentic Websites
Language:Jupyter Notebook18 1 0
Elmehdi9/web-scraping-projects
This repository provides various web scraping projects in Jupyter notebooks for both learning and data-related workshopes
Language:Jupyter Notebook13 1 02
FirasKahlaoui/news-headlines-tracker
The News Headlines Tracker application collects the latest news headlines from major news sources such as CNN, BBC, and The New York Times.
Language:Python12 0 0
e-hengirmen/metu-NTE-scraper
Language:Python11 1 42
irfanalidv/trustpilot_scraper
A Python library for scraping Trustpilot reviews.
Language:Python11 1 09
sarperavci/kick-unofficial-api
🛡️ Unofficial Kick.com API wrapper with automatic bypass protection.
Language:Python10 1 14
boo283/Facebook_comment_crawler
The Facebook Comments Crawler is an unofficial tool for extracting comments from Facebook posts using Selenium in Python. It's designed to aid in academic and personal research. #Facebook comments scaper #Facebook comments crawler
Language:Python9 1 01
JaydeepAgravat/SmartCode
Scraping LeetCode data, analyzing for insights, crafting a user-friendly dashboard, and building a problem recommender for optimized problem-solving.
Language:Jupyter Notebook8 1 02
lombardo-luca/LePrAn
Letterboxd Profile Analyzer (LePrAn) is a simple tool to see statistics about your letterboxd.com profile.
Language:Python8 1 01
gayanukabulegoda/Web-Scraping-Starter-Kit
Repository designed to help freshers easily grasp the basics of web scripting, offering simple guides and examples to build a strong foundation.
Language:Python7 1 0
Mindful-AI-Assistants/SP2024-Election-Analysis
📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.
Language:HTML7 1 233
odevjorge/instagram-post-fetcher
"instagram-post-fetcher" is a Python module leveraging Selenium to extract Instagram post details, including account username, descriptions, media URLs, and post timestamps. Simplifying access to Instagram data for analytics and research.
Language:Python7 1 10
oxylabs/asynchronous-web-scraping-python
A comparison of asynchronous and synchronous web scraping methods with practical examples.
Language:Python7 1 0

web-scraping-python

scrapy/scrapy

seleniumbase/SeleniumBase

D4Vinci/Scrapling

omkarcloud/botasaurus

oxylabs/how-to-scrape-amazon-product-data

oxylabs/oxylabs-ai-studio-py

tinyfish-io/agentql

0x676e67/rnet

scrapfly/scrapfly-scrapers

davidteather/everything-web-scraping

oxylabs/Python-Web-Scraping-Tutorial

drshahizan/python-web

thewebscraping/tls-requests

oxylabs/web-scraping-google-sheets

DataCrawl-AI/datacrawl

vishwajeetdabholkar/eGet-Crawler-for-ai

mike-gee/webtranspose

GoncaloMark/CobWeb-lnx

mhmdkardosha/CAT-Reloaded-2025-Data-Science-Roadmap

Decodo/Python-scraper-tutorial

raymelon/tagalog-dictionary-scraper

kvcops/Deep-Research-using-Gemini-api

Narenpradhan/WatchTower

PB2204/Covid-19

Elmehdi9/web-scraping-projects

FirasKahlaoui/news-headlines-tracker

e-hengirmen/metu-NTE-scraper

irfanalidv/trustpilot_scraper

sarperavci/kick-unofficial-api

boo283/Facebook_comment_crawler

JaydeepAgravat/SmartCode

lombardo-luca/LePrAn

gayanukabulegoda/Web-Scraping-Starter-Kit

Mindful-AI-Assistants/SP2024-Election-Analysis

odevjorge/instagram-post-fetcher

oxylabs/asynchronous-web-scraping-python