datascraping

There are 189 repositories under datascraping topic.

  • UltimaHoarder/UltimaScraper

    Scrape all the media from an OnlyFans account - Updated regularly

    Language:Python4.1k1801.5k616
  • Python

    Tanu-N-Prabhu/Python

    This repository helps you understand python from the scratch.

    Language:Jupyter Notebook1.5k519751
  • fansly-downloader

    Avnsx/fansly-downloader

    Easy to use fansly.com content downloading tool. Written in python, but ships as a standalone Executable App for Windows too. Enjoy your Fansly content offline anytime, anywhere in the highest possible content resolution! Fully customizable to download in bulk or single: photos, videos & audio from timeline, messages, collection & specific posts 👍

    Language:Python1.3k368568
  • sim0n00ps/OF-DL

    C# console app to download all of the media from Onlyfans accounts with DRM video downloading support

    Language:C#1.1k42675106
  • datawhores/OF-Scraper

    A completely revamped and redesigned fork, reimagined from scratch based on the original onlyfans-scraper

    Language:Python7621545067
  • benibela/xidel

    Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

    Language:Pascal7102811343
  • scrapfly/scrapfly-scrapers

    Scalable Python web scraping scripts for +40 popular domains

    Language:Python4591016115
  • DwarfThief/Raspagem-de-dados-para-iniciantes

    Raspagem de dados para iniciante usando Scrapy e outras libs básicas

    Language:Python1339521
  • sim0n00ps/OF-DRM

    C# console app to download DRM protected videos from Onlyfans accounts

    Language:C#10543212
  • Gertje823/Vinted-Scraper

    This is a tool to scrape/download images and data from Vinted & Depop using the API and stores the data in a SQLite database.

    Language:Python9895124
  • jordon31/OnlySnap

    Scrape content from OnlyFans #onlyfans -- #of-scr -- #onlyfans scrape -- #onlyfans-dl -- OnlyFans content downloader -- #of scrap -- #onlysnap

    Language:Python7834610
  • castlelemongrab/parlance

    A minimum-dependency ECMAScript client library and CLI tool for Parler – a "free speech" social network that accepts real money to buy "influence" points to boost organic non-advertising content

    Language:JavaScript709478
  • kennymkchan/funko-pop-data

    Open-source database of all Funko Pop data.

    Language:JavaScript556517
  • arbuzovv/rusquant

    Official version of rusquant package for R

    Language:R4621622
  • jwillmer/web-scraper-chrome-extension

    Web data extraction tool implemented as chrome extension

    Language:JavaScript288185
  • Python-Data-Scraping-IMDb-Movie-site-using-BeautifulSoup-Series-1-

    Reljod/Python-Data-Scraping-IMDb-Movie-site-using-BeautifulSoup-Series-1-

    Data Scraping using Python BeautifulSoup

    Language:Jupyter Notebook252021
  • yuis-ice/jseval

    Evaluate JavaScript on a URL through headless Chrome browser.

    Language:JavaScript25201
  • dimitryzub/hotels-scraper-js

    Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.

    Language:JavaScript17232
  • agnosto/fansly-recorder

    Record fansly streams live and upload to remote using rclone

    Language:Python16362
  • Agenty/scrapingai

    Build web scraping agents using AI to auto-extract the data from websites, capture screenshot, generate pdf from URL and web crawling with Agenty

    Language:TypeScript15102
  • kanishkan91/Python-DataUpdate-DataProcessor-kbn

    The python module can be used to scrape data and process data from different sources. The python module can output data as either as a dataframe in the country year format or it will output data in excel files This module has primarily been created for processing data for the International Futures (IFs) Project however, it can be used to process data in general. The module can be used to process data from the following sources, 1) World Bank World Development Indicators (WDI) 2) UNESCO Education indicators(UIS) 3) FAO Food Balance Sheets (FAO) 4) IMF Global Finance Statistics (IMF GFS) 5) Health data from the Institute for Health and Metric Evaluation (IHME) 6) Water data from FAO AQUASTAT 7) Energy data from EIA Currently this module can be run as is on Windows. For usage on Macs, the user may have to make changes to the code lines which specify paths.

    Language:Python15306
  • sahilbhange/Facebook-Data-Extraction

    #DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.

    Language:Python140110
  • easonlai/playstore_reviews_scraping_and_text_analytics

    This is demo repo to demostrate how to scrape apps review data from Google Play Store by Python with library Google-Play-Scraper. And then use Azure Text Analytics to perform sentiment analysis for reviews content (aka comments).

    Language:Jupyter Notebook11103
  • ice-wzl/DataReaper

    DataReaper is a powerful Python tool designed to harvest data from publicly accessible HTTP servers. It combines the capabilities of Shodan search with web scraping techniques to efficiently gather information from targeted websites.

    Language:Python11151
  • nba-topshot-scraper

    kennymkchan/nba-topshot-scraper

    Node script that will use Selenium to scrape card information from NBA Topshot including card names, rarity, and lowest cost at the moment. Data is scraped once per day.

    Language:JavaScript11132
  • jack-madison/Evo-Car-Share-App-Scrape

    The code in this repository retrieves the location of all available EVO cars as well as the current energy level of each vehicle from the EVO Android application. EVO is a car sharing service based in Vancouver, BC.

    Language:Python9111
  • Data-Horde/ytcc-archive

    archiving community contributions on YouTube: unpublished captions, title and description translations and caption credits

    Language:Python8421
  • DeDeDeDer/Personal_Projects

    This holds all my personal data-related project's (Automation, Modelling, Analysis)

    Language:Python7103
  • VirginiaTech/pyvt

    A Python API for the VT timetable of classes

    Language:HTML7435
  • lavgen/WikileaksAPI-project

    Language:JavaScript6101
  • LynnFernandes23/Movie-Recommedation-System

    I developed a sophisticated movie recommendation system using Python, leveraging key libraries such as Pandas, NumPy, Scikit-Learn, and Natural Language Toolkit (NLTK). The system utilizes data scraping techniques to gather movie information and employs advanced data visualization techniques for insightful analysis.

    Language:Jupyter Notebook5101
  • cchrisnguyen/FlightRadar24

    A shell script for scraping FlightRadar24's flight tracking data.

    Language:Shell4110
  • dimitryzub/py-google-scholar-organic-cite-to-csv-sqlite

    Scrape historic Google Scholar Organic and Cite results to CSV, MySQL Lite using Python and SerpApi.

    Language:Python4104
  • LynnFernandes23/Loksabha-Election-2024-Analysis-Through-Power-BI

    This repository hosts interactive dashboards and detailed data visualizations that provide insights into the 2024 Indian parliamentary elections. Utilizing Power BI, we've analyzed voter demographics, electoral results, constituency-wise trends, and more, offering a comprehensive view of the election dynamics.

  • scrape-do/scrapedo-scrapers

    Web scraping scripts with Scrape.do 😎

    Language:Python4200
  • TheOwaisShaikh/Langchainwebsitescraper

    Extract product details from WooCommerce sites using the langchain web extraction library and OpenAI's GPT models.

    Language:Python4101