scrapping-python
There are 275 repositories under scrapping-python topic.
shaikhsajid1111/social-media-profile-scrapers
Fetch user's data across social media
lkuffo/web-scraping
Más de 50 ejemplos de web scraping utilizando: Requests | Scrapy | Selenium | LXML | BeautifulSoup
MLArtist/WebScraper
Python-based web crawling script with randomized intervals, user-agent rotation, and proxy server IP rotation to outsmart website bots and prevent blocking.
odaysec/NewsCrap
NewsCrap adalah alat scraping berita Google berbasis Command Line Interface (CLI) yang dirancang untuk riset, investigasi, dan pengumpulan data OSINT. Dengan fitur canggih seperti rotation proxy, scheduling otomatis, dan multi-format export, alat ini memudahkan pengumpulan data berita secara efisien dan andal.
vasusharma7/flipkart-grid
Fashion Intelligence system by Team CodePhreaks26 for Flipkart Grid 2.0
SREEHARI1994/InstagramScraper
Download photos,reels and stories of any instagram account, public or Private(that you have access to) to your PC folders
harsh4870/Justdail-scrapper
A 100% working Justdial scrapper, Just enter the url and it'll extract business info from it
joaroque/gigabot-plus
Bot que responde automáticamente as perguntas do giga unitel
luismr/the-pudim-hunter
The Pudim Hunter 🍮 is a Proof of Concept (PoC) tool to scrape job listings from SimplyHired, analyze them against your resume, and assign a relevance score. Get insights into how well each job matches your skills. Automate your job search smarter! 🚀
edrisranjbar/AparatDownloader
A simple Aparat Video Downloader Script
alvaroarcelus/Sentiment-Analysis-Pipeline-for-Call-Center-Calls
End-to-end Sentiment Analysis Pipeline for Call Center Conversations
suman-kr/facebook-automation
:gem: Facebook login Automation using Selenium webdriver
egypy/egy_best
Unofficial api for egybest has all properties of the offical site
hmshb/scraping-agent-ai
AI-powered web scraping agent built with LangGraph, LangSmith, Firecrawl, and Anthropic AI. Automates intelligent crawling, structured data extraction, and LLM-powered content formatting. Efficiently handles anti-bot mechanisms, error recovery, and batch processing. 🚀
fediazgon/mayoclinic-scrapper
Scrapping diseases information from Mayo Clinic and saving it in Neo4j
zahidadeel/yad2scrapper
Real Estate Scrapper for scrapping data from YAD2 site (yad2.co.il).
abougouffa/arabic-fonts-scraper
A simple script to download all Arabic fonts from the arfonts.net website
adityajn105/cricket_data_extracter
A set of python scripts to extract cricket data from https://cricbuzz.com for analytics purpose.
3amory99/Amazon-Product-Scrapping-with-Selenium
Gather essential product data from Amazon with ease using this Python web scraper and Selenium. Extract product descriptions, prices, ratings, and more for insightful market research and analysis.ct data from Amazon with ease using this Python web scraper. Extract product descriptions, prices, ratings, and more for insightful market resear
Gdi87/Webscrapper
web Scrapper In Python
OsamaM0/MCQ_Webscrapping_Telegram_bot
This is Telegram bot that make webscrapping in website and get the MSQ and create Quizzes From MSQ
LashaGoch/Selenium-Python-Web-Scraping-Project
Web scraping of www.openaq.org for open source data to collect air pollution data. Tools used - Selenium Python.
rexshijaku/FacebookPageAboutScrapper
Scrappes About section of any Facebook Page
SebaPansecchi/Disney-Web-Scrapping
Lista de películas de Disney desde sus inicios hasta 2022
VolkanSah/The_Extractor.py
The Extractor is a Python script that extracts Google dorks from the official Google Hacking Database (GHDB) XML file and saves them in a CSV file. The script only extracts dorks that contain the "inurl:" operator because they are more specific and useful for targeted web scanning.
Anwarvic/GutenbergScrapper
This repo contains a scrapper for the Gutenberg's project website which contains 56,019 books free to read and download. In this repo also, you can find text file containing all the book data until April 2018 containing only the 'id', 'title' and 'authors' for every book in the dataset.
DragonflyRobotics/MAGIST-Algorithm
Multi-Agent Generally Intelligent Simultaneous Training Algorithm for Project Zeta
JackyKch/LiverpoolEvolutionKlopp
Study Jürgen Klopp influence over Liverpool since his appointment as manager of the Reds through Data, Machine Learning and Data Visualization.
lyteloli/content-poster
A Telegram bot that can get images at yande.re and post them to your channel
geoinfo-smdu/cgesp_scrap
Extração de endereços dos pontos de alagamentos do site da CGESP (Centro de Emergências Climáticas)
Laurence-Wu/zLibraryScrapper
This is the crowler program for the SeekHub project
memosasoft/image-hunter
Smart image scraper, spider and image depot builder.
wbwlkr/lebonscrap
LeBonScrap is a spider which collect data from Leboncoin.fr, crawl all the pagination links to scrap every ads of the list from one search result of the real-estate category.
ZahraneRabhi/Twitter-API
🔵Twitter-API is a lightweight Python tool for interacting with the Twitter API. It provides a simple way to fetch and store tweets in a database.