datascraping

There are 187 repositories under datascraping topic.

UltimaHoarder/UltimaScraper
Scrape all the media from an OnlyFans account - Updated regularly
Language:Python4.2k 179 1.5k616
Tanu-N-Prabhu/Python
This repository helps you learn Python and Machine Learning from scratch.
Language:Jupyter Notebook1.8k 50 7852
Avnsx/fansly-downloader
Easy to use fansly.com content downloading tool. Written in python, but ships as a standalone Executable App for Windows too. Enjoy your Fansly content offline anytime, anywhere in the highest possible content resolution! Fully customizable to download in bulk or single: photos, videos & audio from timeline, messages, collection & specific posts 👍
Language:Python1.4k 37 8570
datawhores/OF-Scraper
A completely revamped and redesigned fork, reimagined from scratch based on the original onlyfans-scraper
Language:Python900 16 51378
benibela/xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Language:Pascal817 27 11946
scrapfly/scrapfly-scrapers
Scalable Python web scraping scripts for +40 popular domains
Language:Python746 15 22161
sim0n00ps/OF-DRM
C# console app to download DRM protected videos from Onlyfans accounts
Language:C#209 8 5527
DwarfThief/Raspagem-de-dados-para-iniciantes
Raspagem de dados para iniciante usando Scrapy e outras libs básicas
Language:Python136 9 522
Gertje823/Vinted-Scraper
This is a tool to scrape/download images and data from Vinted & Depop using the API and stores the data in a SQLite database.
Language:Python113 9 5526
jordon31/OnlySnap
Scrape content from OnlyFans #onlyfans -- #of-scr -- #onlyfans scrape -- #onlyfans-dl -- OnlyFans content downloader -- #of scrap -- #onlysnap
Language:Python96 3 4811
castlelemongrab/parlance
A minimum-dependency ECMAScript client library and CLI tool for Parler – a "free speech" social network that accepts real money to buy "influence" points to boost organic non-advertising content
Language:JavaScript70 9 478
kennymkchan/funko-pop-data
Open-source database of all Funko Pop data.
Language:JavaScript60 6 517
arbuzovv/rusquant
Official version of rusquant package for R
Language:R45 2 1623
jwillmer/web-scraper-chrome-extension
Web data extraction tool implemented as chrome extension
Language:JavaScript28 8 185
Reljod/Python-Data-Scraping-IMDb-Movie-site-using-BeautifulSoup-Series-1-
Data Scraping using Python BeautifulSoup
Language:Jupyter Notebook25 2 019
yuis-ice/jseval
Evaluate JavaScript on a URL through headless Chrome browser.
Language:JavaScript25 2 01
Agenty/scrapingai
Build web scraping agents using AI to auto-extract the data from websites, capture screenshot, generate pdf from URL and web crawling with Agenty
Language:TypeScript21 1 03
agnosto/fansly-recorder
Record fansly streams live and upload to remote using rclone
Language:Python21 3 62
dimitryzub/hotels-scraper-js
Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.
Language:JavaScript18 2 32
kanishkan91/Python-DataUpdate-DataProcessor-kbn
The python module can be used to scrape data and process data from different sources. The python module can output data as either as a dataframe in the country year format or it will output data in excel files This module has primarily been created for processing data for the International Futures (IFs) Project however, it can be used to process data in general. The module can be used to process data from the following sources, 1) World Bank World Development Indicators (WDI) 2) UNESCO Education indicators(UIS) 3) FAO Food Balance Sheets (FAO) 4) IMF Global Finance Statistics (IMF GFS) 5) Health data from the Institute for Health and Metric Evaluation (IHME) 6) Water data from FAO AQUASTAT 7) Energy data from EIA Currently this module can be run as is on Windows. For usage on Macs, the user may have to make changes to the code lines which specify paths.
Language:Python15 3 07
sahilbhange/Facebook-Data-Extraction
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
Language:Python13 0 110
scrape-do/scrapedo-scrapers
Web scraping examples with Scrape.do 😎
Language:Python12 2 00
easonlai/playstore_reviews_scraping_and_text_analytics
This is demo repo to demostrate how to scrape apps review data from Google Play Store by Python with library Google-Play-Scraper. And then use Azure Text Analytics to perform sentiment analysis for reviews content (aka comments).
Language:Jupyter Notebook11 1 03
ice-wzl/DataReaper
DataReaper is a powerful Python tool designed to harvest data from publicly accessible HTTP servers. It combines the capabilities of Shodan search with web scraping techniques to efficiently gather information from targeted websites.
Language:Python11 1 51
kennymkchan/nba-topshot-scraper
Node script that will use Selenium to scrape card information from NBA Topshot including card names, rarity, and lowest cost at the moment. Data is scraped once per day.
Language:JavaScript11 1 32
Data-Horde/ytcc-archive
archiving community contributions on YouTube: unpublished captions, title and description translations and caption credits
Language:Python9 3 21
VirginiaTech/pyvt
A Python API for the VT timetable of classes
Language:HTML8 3 37
DeDeDeDer/Personal_Projects
This holds all my personal data-related project's (Automation, Modelling, Analysis)
Language:Python7 1 03
lavgen/WikileaksAPI-project
Language:JavaScript7 1 01
LynnFernandes23/Movie-Recommedation-System
I developed a sophisticated movie recommendation system using Python, leveraging key libraries such as Pandas, NumPy, Scikit-Learn, and Natural Language Toolkit (NLTK). The system utilizes data scraping techniques to gather movie information and employs advanced data visualization techniques for insightful analysis.
Language:Jupyter Notebook5 1 01
cchrisnguyen/FlightRadar24
A shell script for scraping FlightRadar24's flight tracking data.
Language:Shell4 1 10
dimitryzub/py-google-scholar-organic-cite-to-csv-sqlite
Scrape historic Google Scholar Organic and Cite results to CSV, MySQL Lite using Python and SerpApi.
Language:Python4 1 04
greeshmasunil10/LottoMaxAnalyserBE
A tool for analyzing the results of the Canadian Lotto Max lottery
Language:Python4 1 00
kennymkchan/greater-toronto-area-housing-data
Data scraped from various sites for housing data around the greater Toronto area (GTA). Scrapes happen daily and data is in both JSON and CSV formats. Free to use for analysis.
4 2 12
LynnFernandes23/Loksabha-Election-2024-Analysis-Through-Power-BI
This repository hosts interactive dashboards and detailed data visualizations that provide insights into the 2024 Indian parliamentary elections. Utilizing Power BI, we've analyzed voter demographics, electoral results, constituency-wise trends, and more, offering a comprehensive view of the election dynamics.
4 1 00
TheOwaisShaikh/Langchainwebsitescraper
Extract product details from WooCommerce sites using the langchain web extraction library and OpenAI's GPT models.
Language:Python4 1 01

datascraping

UltimaHoarder/UltimaScraper

Tanu-N-Prabhu/Python

Avnsx/fansly-downloader

datawhores/OF-Scraper

benibela/xidel

scrapfly/scrapfly-scrapers

sim0n00ps/OF-DRM

DwarfThief/Raspagem-de-dados-para-iniciantes

Gertje823/Vinted-Scraper

jordon31/OnlySnap

castlelemongrab/parlance

kennymkchan/funko-pop-data

arbuzovv/rusquant

jwillmer/web-scraper-chrome-extension

Reljod/Python-Data-Scraping-IMDb-Movie-site-using-BeautifulSoup-Series-1-

yuis-ice/jseval

Agenty/scrapingai

agnosto/fansly-recorder

dimitryzub/hotels-scraper-js

kanishkan91/Python-DataUpdate-DataProcessor-kbn

sahilbhange/Facebook-Data-Extraction

scrape-do/scrapedo-scrapers

easonlai/playstore_reviews_scraping_and_text_analytics

ice-wzl/DataReaper

kennymkchan/nba-topshot-scraper

Data-Horde/ytcc-archive

VirginiaTech/pyvt

DeDeDeDer/Personal_Projects

lavgen/WikileaksAPI-project

LynnFernandes23/Movie-Recommedation-System

cchrisnguyen/FlightRadar24

dimitryzub/py-google-scholar-organic-cite-to-csv-sqlite

greeshmasunil10/LottoMaxAnalyserBE

kennymkchan/greater-toronto-area-housing-data

LynnFernandes23/Loksabha-Election-2024-Analysis-Through-Power-BI

TheOwaisShaikh/Langchainwebsitescraper