scraping-web

There are 82 repositories under scraping-web topic.

  • MetaDetective

    franckferman/MetaDetective

    🕵️ Unleash Metadata Intelligence with MetaDetective. Your Assistant Beyond Metagoofil.

    Language:Python4085043
  • deliton/idt

    Image Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process.

    Language:Python2309927
  • pavlovtech/WebReaper

    Web scraper, crawler and parser in C#. Designed as simple, declarative and scalable web scraping solution.

    Language:C#13051132
  • hhuayuan/spiderbuf

    Spiderbuf 是一个专注于 Python 爬虫练习的网站。提供丰富的爬虫教程、爬虫案例解析和爬虫练习题。Python爬虫开发强化练习,在矛与盾的攻防中不断提高技术水平,通过大量的爬虫实战掌握常见的爬虫与反爬套路。 引导式爬虫案例 + 免费爬虫视频教程,以闯关的形式挑战各个爬虫任务,培养爬虫开发的直觉及经验,验证自身爬虫开发与反爬虫实力的时候到了。

    Language:Python1161111
  • ScrapingAnt/amazon_scraper

    Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt

    Language:JavaScript875719
  • codegratia/react-node-web-scraper

    Final Year project, scraping data of e-commerce stores and display in ReactJS app.

    Language:JavaScript501024
  • officialpm/scrape-amazon

    🤩 Python Package for Scraping Amazon Product Reviews ✨

    Language:Python391413
  • Decodo/eCommerce-Scraping-API

    eCommerce Scraping API code examples for Python, PHP and Node.js

    Language:PHP26004
  • Decodo/Web-Scraping-API

    Web Scraping API code examples for Python, PHP and Node.js

    Language:JavaScript25009
  • kurt213/scraper-auto-trader

    A web scraper for extracting car ads data from Auto Trader

    Language:Python24128
  • Decodo/SERP-Scraping-API

    SERP Scraping API code examples for Python, PHP and Node.js

    Language:PHP17004
  • ScrapingAnt/alibaba_scraper

    Alibaba scraper with using of rotating proxies and headless Chrome from ScrapingAnt

    Language:Python17342
  • Joabutt/ScrapeDogg

    GPT4 Assisted Web Scraping Library

    Language:JavaScript16100
  • 0xAmmar/AcquiFinder

    Get acquisitions by scraping titles of crunchbase.

    Language:Python15
  • alexferrari88/scrapeblocks

    Scraping automation framework based on Playwright

    Language:TypeScript14200
  • epythonlab/github-search-tool

    Github Repository Search Tool

    Language:Python14122
  • Web-Scraping-Starter-Kit

    gayanukabulegoda/Web-Scraping-Starter-Kit

    Repository designed to help freshers easily grasp the basics of web scripting, offering simple guides and examples to build a strong foundation.

    Language:Python710
  • nikiroo/fanfix-jexer

    A TUI interface with the library Jexer for fanfix

    Language:Java7101
  • gabyah92/HackerRankLeaderboardGUI

    Scraping Leaderboards from HackerRank using Python

    Language:Python6102
  • agoutsmedt/methodsnet_scraping

    This repository contains the materials for the training session about “Mastering Web Scraping for Data Collection” held during MethodsNET workshop in October 2024 at UCLouvain.

    Language:HTML51
  • KAispread/Instagram-image-downloader

    💟 Instagram Image Downloader

    Language:Java5101
  • TheNoiselessNoise/csfd_scraper

    Simple scraper for CSFD.cz, a Czech movie database.

    Language:Python5301
  • egin10/dapodik_go

    Command Line App untuk scraping data sekolah dari web dapodik (Data Refrensi) : https://referensi.data.kemdikbud.go.id

    Language:Go4100
  • Scraping-Deputes-France

    franckferman/Scraping-Deputes-France

    Script pour scraper les député·e·s français (Nom, Région, Email, Groupe, Circonscription) depuis le site de l'Assemblée nationale.

    Language:Python4100
  • muzzlol/review-radar

    A web application that classifies reviews as real or fake by utilizng an NLP model (SVC classifier). It leverages crawl4ai for scraping reviews off of any product/service page, feeds it to the NLP model whose output gets displayed to the users.

    Language:TypeScript41
  • nikiroo/fanfix

    A small program to download and convert fanfictions and comics from supported websites into offline files (epub, cbz...)

    Language:Java4320
  • Strykez/fastscrape

    A simple web scraper built with python and beautifulfoup.

    Language:Python4201
  • TheOwaisShaikh/Langchainwebsitescraper

    Extract product details from WooCommerce sites using the langchain web extraction library and OpenAI's GPT models.

    Language:Python4101
  • trkgrn/cheapest-product-app

    Kocaeli Uni. Yaz. Lab. 1.1

    Language:Java4101
  • m0-k1/YouTube-Scrapping

    Scraped YouTube site for video links and other meta-data

    Language:Jupyter Notebook3102
  • rizkhal/aniflix-api

    🔥 AniFlix API

    Language:JavaScript3101
  • roxylius/ChatGPT_unofficial_API_Node

    Unofficial ChatGPT Puppeteer API: A lightweight Node.js/Express backend that automates the ChatGPT web interface—with persistent sessions, conversation threading, and “Reason” & “Search” modes—via Puppeteer.

    Language:JavaScript3
  • yashsonwane/Scraping_GitHub_Topics

    Scraping Github topics page. In scraping, first scrape topics Titles, Descriptions and URLs. By using the above info scrape all Topics which contain Username, Repo_name, Stars and repo_url.

    Language:HTML3100
  • CobaltGoldCS/Spindler

    A book reading app optimized for Android. Use css paths to obtain content in a readable format

    Language:C#22100
  • Marco90v/ScrapingDolarBs

    Scraping Web del precio del dolar en bolívares, script para usar en polybar, extrae y muestra resultado

    Language:Python2101
  • SumitM01/Integrated-skill-tracker-using-websraping

    A skill development tracking application which uses web scraping to collect data from various coding websites and displays stats for each user in one place making it easy to learn coding.

    Language:Python2100