scraping-tool

There are 43 repositories under scraping-tool topic.

  • lorien/awesome-web-scraping

    List of libraries, tools and APIs for web scraping and data processing.

    Language:Makefile6.8k23210791
  • botasaurus

    omkarcloud/botasaurus

    The All in One Framework to build Awesome Scrapers.

    Language:Python1.6k16150146
  • ispras/web-scraper-chrome-extension

    Web data extraction tool implemented as chrome extension

    Language:JavaScript22783469
  • pavlovtech/WebReaper

    Web scraper, crawler and parser in C#. Designed as simple, declarative and scalable web scraping solution.

    Language:C#11361127
  • fernandod1/Instagram-downloader

    Instagram user's photos and videos downloader. Download all media files from any username. Working 2022!

    Language:Python694517
  • OpenByteDev/SourceScraper

    Simple library which helps you to retrieve the source of various video streaming sites.

    Language:TypeScript6771518
  • lspahija/torchestrator

    Spin up Tor containers and then proxy HTTP requests via these Tor instances

    Language:Kotlin42508
  • ScrapingAnt/zoominfo_scraper

    Zoominfo scraper with using of rotating proxies and headless Chrome from ScrapingAnt

    Language:Python32589
  • harismuneer/Android-Apps-Downloader

    πŸ“± A utility for downloading Android apps from the Google Play Store and Xiaomi App Store (the Chinese App Store).

    Language:Python313119
  • Marcel0024/CocoCrawler

    An declarative and easy to use web crawler and scraper in C#

    Language:C#27103
  • omkarcloud/botasaurus-starter

    πŸš€ OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK πŸ€–

    Language:TypeScript23147
  • fernandod1/ProductHunt-scraper

    Producthunt.com famous website scraper script. Scrap all offers and save in spreadsheet excel file.

    Language:Python21118
  • Joabutt/ScrapeDogg

    GPT4 Assisted Web Scraping Library

    Language:JavaScript18100
  • pim97/scrappey-wrapper-python

    An API wrapper for Scrappey.com written in Python (cloudflare, datadome bypass & solver)

    Language:Python17110
  • ScrapingAnt/alibaba_scraper

    Alibaba scraper with using of rotating proxies and headless Chrome from ScrapingAnt

    Language:Python16433
  • mapmeld/aoc_reply_dataset

    Building a dataset of Twitter replies for unsupervised learning / bot-blocking

    Language:Python13303
  • DemonMartin/scrappey-wrapper

    An API wrapper for Scrappey.com written in Node.js (cloudflare bypass & solver)

    Language:JavaScript12124
  • pim97/scrappey.js

    Scrappey.js: A versatile JavaScript wrapper for Scrappey API for solving Cloudflare, datadome, enabling seamless web scraping of anti-bot protected websites. Simplify data extraction with robust functionality and reliable results. Unlock valuable insights effortlessly. Get started with Scrappey

    Language:JavaScript8104
  • skvrahul/chegg_dl

    Python script to automate the download of textbooks from Chegg

    Language:Python8125
  • jaeyk/digital_data_collection_workshop

    Digital Data Collection Workshop

    Language:HTML732
  • omkarcloud/web-scraping-template

    πŸš€ THIS WEB SCRAPING TEMPLATE PROVIDES YOU WITH A GREAT STARTING POINT WHEN CREATING WEB SCRAPING BOTS. πŸ€–

    Language:Python7103
  • rija/ghost-ssg

    A Docker-based pipeline to publish the content of a local Ghost 4 server as static pages.

    Language:Shell72200
  • MustakAbsarKhan/DSE_COMPANY_SCRAPER_Python

    The DSE Company Scraper is a Python program that extracts data from the Dhaka Stock Exchange website and saves it to an Excel file for analysis.

    Language:Python5202
  • ayushsoni1010/portfoliogram

    ⚑️Elevate your portfolio analysis with our cutting-edge web scraping tool. Uncover valuable insights about individuals, their skills, and social profiles effortlessly.

    Language:JavaScript3300
  • Ekans111/Aliexpress-scraper-without-api-free

    Aliexpress products scraping.

    Language:Python3102
  • luminati-io/Awesome-Web-Scraping

    A list of libraries, tools, and APIs for web scraping and data processing. Find everything you need for extracting, managing, and processing data from the web, from HTTP libraries to browser automation tools and proxy services.

    30
  • mannasoumya/instapy_pubprofiles

    Download Instagram Photos and Videos given Link to Posts

    Language:Python3200
  • CyberSudo/Udemy-Enroller

    Python script that written by selenium to automate the process of enrolling in Udemy courses.

    Language:Python2102
  • kawsarlog/AmerisourceBergen

    πŸ› οΈ Python 🐍 script automates the extraction of product pricing details from the AmerisourceBergen 🌐 website https://abcorder.amerisourcebergen.com By inputting your username, password, and National Drug Code (NDC) codes and the πŸ“œ script navigates the website and retrieves the πŸ’° Average Wholesale Price (AWP) and Acquisition Cost (AC) πŸ“Š data

    Language:Python2100
  • patgdut/GoogleMapsScraper

    By scraping leads from Google Maps, you can build a database of potential customers who have shown interest in products or services related to your business. This data can be used for targeted marketing campaigns, email outreach, or sales prospecting.

  • rifki/web-scraping-job-postings

    Web Scraping wirh Node.js - Puppeteer https://blog.rifkilabs.net/web-scraping-dengan-node-js.html

    Language:JavaScript2200
  • Kareem-Emad/youtube_metadata_scraper

    An expansion over the Youtube-8m Dataset to get more data about the videos such likes/views and channel info through scrapping youtube

    Language:Python1200
  • MaxValue/IsJavascriptWorking

    test if your damn browser has JS enabled

    Language:HTML1101
  • MoutasemZ/Reddit-Scraper

    Reddit-Scraper is a tool that I have developed to scrape the content of specific subreddits, and I have used it in the research of my Ph.D dissertation in Health Informatics at the University of Waterloo, Ontario, Canada.

    Language:Java1000
  • scrape-do/python-sample

    Best Rotating Proxy & Scraping API Alternative. Python Example.

    Language:Python1100
  • MostafaHima/Speed-Test-Twitter-Bot

    A project that uses Selenium to test internet speed and automatically posts the results on Twitter.

    Language:Python