scraping-python
There are 602 repositories under scraping-python topic.
IG_unfollow
IG_unfollow is a Python script that helps you identify users on Instagram whom you are following, but who do not follow you back.
crawlbase-python
Fast python library for the Crawlbase API
Ebay-Scraper
Get the average price of a product on eBay or a list of products on sale.
InstaCapture
An anonymous Instagram stories, posts, reels, IGTV videos, and profile pictures effortlessly. Supports cookie-based authentication for stories and allows downloading media without authentication for other types.
jwscraper
A python library for scraping videos from JW Player
linkedin-scraping-tools
A set of simple tools leveraging Selenium for scraping LinkedIn through LinkedIn Sales Navigator or LinkedIn Recruiter.
Flight-Tracker
Flight Tracker: Real-time flight updates and interactive map for seamless tracking and staying in the loop.
instagram-scraping-fish
A tutorial for scraping Instagram profile information and posts using Scraping Fish API: https://scrapingfish.com
ensembledata-python
Python library to scrape social media data via the EnsembleData API.
webscraping-benchmark
Web scraping API benchmark
facebook_post_scraping_and_text_analytics
This is demo repo to demostrate how to scrape post data from Facebook by Python with library facebook_scraper. And then use Azure Text Analytics to perform sentiment analysis for post text content.
robbot
My personal Telegram bot made in Python. It has several features and it's based on Pyrogram.
telegram_members_scrapper
Python Script to scrape members from a selected Telegram group.
outscraper-php
The library provides convenient access to the Outscraper API from applications written in the PHP language. Allows using Outscraper's services from your code.
Crewzombitx64
This project was inspired by the unclecode/crawl4ai repository. It provided valuable insights and ideas that helped shape the development of Crewzombitx64.
scrapers
Scrapin' some data, man
AcquiFinder
Get acquisitions by scraping titles of crunchbase.
scrapemed
ScrapeMed: Data scraping for PubMed Central.
Linkedin-Job-Postings-Visualization-and-Analysis-Python
This Python script scrapes up to 100 most recent Linkedin job postings of any job title and creates sentiment visualization in a form of a word cloud.
web-scraper
Web Scraper is a compact Python tool for fetching web pages and extracting links, phone numbers, emails and addresses using requests and BeautifulSoup.
github-search-tool
Github Repository Search Tool
youtube-livechat-scraper
grab youtube live chat data from existing VODs
MinecraftServerForkDownloads
Easy access to (almost) all minecraft server types/loaders with direct links.
intellifist-ai
Intellifist-AI, yapay zeka ve web scraping tekniklerini kullanarak verileri işleyen, analiz eden ve çeşitli görevleri otomatikleştiren bir Python projesidir. Proje, dinamik web içeriğini çekmek için BeautifulSoup4 ve API entegrasyonu gibi araçları kullanarak veri toplama yeteneklerini sergiliyor.
kafka_scrapy_connect
A custom library that integrates Scrapy with Kafka.
TrackerPriceAmazon-Bot
This Telegram Bot checks the price of a given Amazon Product.
Google-Meet-Attendance-using-Python
Take the GoogleMeet Attendance without much effort!
Persian_Question_Answering_Voice2Voice_AI
This repository hosts BonyadAI, a Persian question answering AI Model. We developed an initial web crawler and scraper to gather the dataset. The second phase involved building a machine learning model based on word embeddings and NLP techniques. This AI model operates end-to-end, receiving user voice input and providing responses in Persian voice.
Agoraphon
A Flask application for analyzing activity on an online discussion forum, using scraping, indexing, analytics, relational graph and NLP.
India-Trade-Data
A web scraper written in Python to gather trade data for India across commodities and countries
legalAI
LegalAI is a passion project which explores and simplifies the complexities of obtaining legal information using LLMs.
raidfscrape
Code to Scraping some portion of Data from forum(RaidForums[seized by FBI]) with Python SCRAPY spiders bypassing recaptcha and storing to PostgreSQL database.(used scrapper-API as captcha, proxy bypass) SQLAlchemy as ORM for PostgreSQL - Python.
PageRank
A power iteration algorithm to sort webpages. Builds off of the PageRank function developed by Sergey Brin and Larry Page.
ipl_mock_auction
Mock IPL auction - Automated decision making
freebooter
freebooter downloads photos & videos from the internet and uploads it onto your social media accounts.