web-scraping-python

There are 203 repositories under web-scraping-python topic.

  • metu-NTE-scraper

    Language:Python10
  • zameen-com-scrapper

    Language:Python4
  • SeleniumBase

    SeleniumBase

    Python APIs for web automation, testing, and bypassing bot-detection.

    Language:Python8.1k
  • Scrapling

    Scrapling

    Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

    Language:Python1.8k
  • botasaurus

    botasaurus

    The All in One Framework to build Awesome Scrapers.

    Language:Python1.6k
  • agentql

    AgentQL is an AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing as UI changes and work across similar sites. Users can define structured data output, making AgentQL versatile for developers and data scientists.

    Language:Python376
  • scrapfly-scrapers

    Scalable Python web scraping scripts for +40 popular domains

    Language:Python349
  • everything-web-scraping

    Learn everything web scraping with David Teather Codes on YouTube

    Language:HTML346
  • Python-Web-Scraping-Tutorial

    Python-Web-Scraping-Tutorial

    In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.

    Language:Python276
  • python-web

    This topic explains how to implement web scraping and python web development. Web scraping topics such as scrapy, beautiful soup, and others will be covered. A case study based on a Malaysian website.

    Language:Jupyter Notebook108
  • datacrawl

    A simple and easy to use web crawler for Python

    Language:Python60
  • webtranspose

    Web scraping API for building AI applications.

    Language:Python41
  • CobWeb-lnx

    CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.

    Language:Python38
  • eGet-Crawler-for-ai

    Web scraping framework built for AI applications. Extract clean, structured content from any website with dynamic content handling, markdown conversion, and intelligent crawling capabilities. Perfect for RAG applications and AI training data pipelines. Features async processing, browser management, and Prometheus monitoring.

    Language:Python29
  • Python-scraper-tutorial

    A short introduction to scraping with Python with given steps and an example scraper script.

    Language:Python27
  • CAT-Reloaded-2025-Data-Science-Roadmap

    Roadmap for Data Science circle associated with CAT Reloaded.

  • WatchTower

    WatchTower - A platform to save your valuable time while staying updated in the Cyber realm.

    Language:Python19
  • Covid-19

    This Is A Web Scraping Projects With Covid-19 Data From 2 Very Popular & Authentic Websites

    Language:Jupyter Notebook18
  • web-scraping-projects

    This repository provides various web scraping projects in Jupyter notebooks for both learning and data-related workshopes

    Language:Jupyter Notebook13
  • news-headlines-tracker

    The News Headlines Tracker application collects the latest news headlines from major news sources such as CNN, BBC, and The New York Times.

    Language:Python11
  • compodio

    compodio

    Putting the podcast in community radio

    Language:Python7
  • Web-Scraping-Starter-Kit

    Web-Scraping-Starter-Kit

    Repository designed to help freshers easily grasp the basics of web scripting, offering simple guides and examples to build a strong foundation.

    Language:Python6
  • Facebook_comment_crawler

    The Facebook Comments Crawler is an unofficial tool for extracting comments from Facebook posts using Selenium in Python. It's designed to aid in academic and personal research. #Facebook comments scaper #Facebook comments crawler

    Language:Python6
  • instagram-post-fetcher

    "instagram-post-fetcher" is a Python module leveraging Selenium to extract Instagram post details, including account username, descriptions, media URLs, and post timestamps. Simplifying access to Instagram data for analytics and research.

    Language:Python6
  • web_scrap_and_analysis

    Extracted text from blogs by "insights.blackcoffer.com" using BeautifulSoup and sentiment is analyzed using pandas module.

    Language:Jupyter Notebook6
  • asynchronous-web-scraping-python

    A comparison of asynchronous and synchronous web scraping methods with practical examples.

    Language:Python6
  • Web-Scraping

    All about scraping domains from the 'World Wide Web'

    Language:Python6
  • gitpod-selenium

    Run Python Selenium in GitPod

    Language:Dockerfile5
  • LePrAn

    Letterboxd Profile Analyzer (LePrAn) is a simple tool to see statistics about your letterboxd.com profile.

    Language:Python5
  • web-scraping-google-sheets

    web-scraping-google-sheets

    Guide to Using Google Sheets for Basic Web Scraping

  • Web-Scraping-Projects

    Drinking coffee is my second favorite thing to do, web scraping will always be first.

    Language:Python4
  • mobilehouse

    Scraped news data using Scrapy and make a pipeline to push data in PostgreSQL database.

    Language:Python4
  • desktop-webpage-text-crawler

    This desktop GUI will index, format and create .txt files from the text content from webpages you request, so long as HTML or JSON is sent as a response. You can crawl sites as single pages, crawl all internal links on a page, or crawl all links within the page's <nav> tag(s). You can also decide to extract only page titles, the main text content, or all text content from the page. The crawler has some built-in basic error logging.

    Language:Python4
  • SP2024-Election-Analysis

    📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.

    Language:Python3
  • trustpilot_scraper

    A Python library for scraping Trustpilot reviews.

    Language:Python3
  • SmartCode

    Scraping LeetCode data, analyzing for insights, crafting a user-friendly dashboard, and building a problem recommender for optimized problem-solving.

    Language:Jupyter Notebook3