scrapping-python

There are 275 repositories under scrapping-python topic.

  • social-media-profile-scrapers

    shaikhsajid1111/social-media-profile-scrapers

    Fetch user's data across social media

    Language:Python496171781
  • lkuffo/web-scraping

    Más de 50 ejemplos de web scraping utilizando: Requests | Scrapy | Selenium | LXML | BeautifulSoup

    Language:Python378200216
  • MLArtist/WebScraper

    Python-based web crawling script with randomized intervals, user-agent rotation, and proxy server IP rotation to outsmart website bots and prevent blocking.

    Language:Python841019
  • odaysec/NewsCrap

    NewsCrap adalah alat scraping berita Google berbasis Command Line Interface (CLI) yang dirancang untuk riset, investigasi, dan pengumpulan data OSINT. Dengan fitur canggih seperti rotation proxy, scheduling otomatis, dan multi-format export, alat ini memudahkan pengumpulan data berita secara efisien dan andal.

    Language:Python521013
  • vasusharma7/flipkart-grid

    Fashion Intelligence system by Team CodePhreaks26 for Flipkart Grid 2.0

    Language:HTML24104
  • SREEHARI1994/InstagramScraper

    Download photos,reels and stories of any instagram account, public or Private(that you have access to) to your PC folders

    Language:Python23244
  • harsh4870/Justdail-scrapper

    A 100% working Justdial scrapper, Just enter the url and it'll extract business info from it

    Language:Python211520
  • gigabot-plus

    joaroque/gigabot-plus

    Bot que responde automáticamente as perguntas do giga unitel

    Language:Python21328
  • luismr/the-pudim-hunter

    The Pudim Hunter 🍮 is a Proof of Concept (PoC) tool to scrape job listings from SimplyHired, analyze them against your resume, and assign a relevance score. Get insights into how well each job matches your skills. Automate your job search smarter! 🚀

    Language:Python212
  • edrisranjbar/AparatDownloader

    A simple Aparat Video Downloader Script

    Language:Python141124
  • alvaroarcelus/Sentiment-Analysis-Pipeline-for-Call-Center-Calls

    End-to-end Sentiment Analysis Pipeline for Call Center Conversations

    Language:Python12110
  • suman-kr/facebook-automation

    :gem: Facebook login Automation using Selenium webdriver

    Language:Python12401
  • egypy/egy_best

    Unofficial api for egybest has all properties of the offical site

    Language:Python11344
  • hmshb/scraping-agent-ai

    AI-powered web scraping agent built with LangGraph, LangSmith, Firecrawl, and Anthropic AI. Automates intelligent crawling, structured data extraction, and LLM-powered content formatting. Efficiently handles anti-bot mechanisms, error recovery, and batch processing. 🚀

    Language:Python11104
  • fediazgon/mayoclinic-scrapper

    Scrapping diseases information from Mayo Clinic and saving it in Neo4j

    Language:Python9213
  • yad2scrapper

    zahidadeel/yad2scrapper

    Real Estate Scrapper for scrapping data from YAD2 site (yad2.co.il).

    Language:Python9119
  • abougouffa/arabic-fonts-scraper

    A simple script to download all Arabic fonts from the arfonts.net website

    Language:Python8404
  • adityajn105/cricket_data_extracter

    A set of python scripts to extract cricket data from https://cricbuzz.com for analytics purpose.

    Language:Python8110
  • 3amory99/Amazon-Product-Scrapping-with-Selenium

    Gather essential product data from Amazon with ease using this Python web scraper and Selenium. Extract product descriptions, prices, ratings, and more for insightful market research and analysis.ct data from Amazon with ease using this Python web scraper. Extract product descriptions, prices, ratings, and more for insightful market resear

    Language:Jupyter Notebook7100
  • Gdi87/Webscrapper

    web Scrapper In Python

    Language:Python7200
  • OsamaM0/MCQ_Webscrapping_Telegram_bot

    This is Telegram bot that make webscrapping in website and get the MSQ and create Quizzes From MSQ

    Language:Python7013
  • LashaGoch/Selenium-Python-Web-Scraping-Project

    Web scraping of www.openaq.org for open source data to collect air pollution data. Tools used - Selenium Python.

    Language:Python6103
  • rexshijaku/FacebookPageAboutScrapper

    Scrappes About section of any Facebook Page

    Language:Python6210
  • SebaPansecchi/Disney-Web-Scrapping

    Lista de películas de Disney desde sus inicios hasta 2022

    Language:Jupyter Notebook6102
  • The_Extractor.py

    VolkanSah/The_Extractor.py

    The Extractor is a Python script that extracts Google dorks from the official Google Hacking Database (GHDB) XML file and saves them in a CSV file. The script only extracts dorks that contain the "inurl:" operator because they are more specific and useful for targeted web scanning.

    Language:Python6103
  • Anwarvic/GutenbergScrapper

    This repo contains a scrapper for the Gutenberg's project website which contains 56,019 books free to read and download. In this repo also, you can find text file containing all the book data until April 2018 containing only the 'id', 'title' and 'authors' for every book in the dataset.

    Language:Python5103
  • DragonflyRobotics/MAGIST-Algorithm

    Multi-Agent Generally Intelligent Simultaneous Training Algorithm for Project Zeta

    Language:Python52130
  • JackyKch/LiverpoolEvolutionKlopp

    Study Jürgen Klopp influence over Liverpool since his appointment as manager of the Reds through Data, Machine Learning and Data Visualization.

    Language:Jupyter Notebook5301
  • lyteloli/content-poster

    A Telegram bot that can get images at yande.re and post them to your channel

    Language:Python5100
  • geoinfo-smdu/cgesp_scrap

    Extração de endereços dos pontos de alagamentos do site da CGESP (Centro de Emergências Climáticas)

    Language:Python4103
  • Laurence-Wu/zLibraryScrapper

    This is the crowler program for the SeekHub project

    Language:Python40
  • memosasoft/image-hunter

    Smart image scraper, spider and image depot builder.

    Language:Python4000
  • wbwlkr/lebonscrap

    LeBonScrap is a spider which collect data from Leboncoin.fr, crawl all the pagination links to scrap every ads of the list from one search result of the real-estate category.

    Language:Python4101
  • Twitter-API

    ZahraneRabhi/Twitter-API

    🔵Twitter-API is a lightweight Python tool for interacting with the Twitter API. It provides a simple way to fetch and store tweets in a database.

    Language:Python4100