scraped-data

There are 91 repositories under scraped-data topic.

CUNY-CL/wikipron
Massively multilingual pronunciation mining
Language:Python324 18 15871
joelbarmettlerUZH/Scrapeasy
Scraping in python made easy - receive the content you like in just one line of code
Language:Python100 7 252
warifp/Shopee-Scrape
Shopee Scrape is a tool that functions to collect data - the data needed, such as finding data from photos, prices, names, store locations and others.
Language:PHP87 5 428
ayaanzhaque/SDCNL
Deep Learning for Suicide and Depression Identification with Unsupervised Label Correction (ICANN 2021)
Language:Python64 5 616
naqushab/SearchEngineScrapy
Scrape data from Google.com, Bing.com, Baidu.com, Ask.com, Yahoo.com, Yandex.com
Language:Python56 10 316
Swader/diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Language:PHP53 9 5320
tangible-idea/BitUtils
Systematic coin price notifier, Telegram public channel history parser, Trading tool with python
Language:Python52 8 021
racinmat/mal-analysis
github repo for MyAnimeList analysis. Also links to the MAL dataset.
Language:Jupyter Notebook33 2 18
benjaminvdb/DBRD
110k Dutch Book Reviews Dataset for Sentiment Analysis
Language:Python30 4 33
recommend-games/board-game-scraper
Board game data scraper
Language:Python25 2 15
palahsu/YouTubeScraper
Scraping YouTube Video Description and Video Likes and Comments and Times and Replies! It's Automatically Extracting Data from Video.
Language:Python23 1 15
fernandod1/ProductHunt-scraper
Producthunt.com famous website scraper script. Scrap all offers and save in spreadsheet excel file.
Language:Python21 1 18
frossm/quoter
Command line utility to display stock quotes and index data
Language:Java20 2 64
Merterm/Etymon
Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.
Language:JavaScript20 2 21
SuperKogito/CoinMarketCapScraper
a small python scraper to scrape historical data from the CoinMarketCap website and convert it to csv files . This is an initial step for a data mining process to develop a predictive model of cryptocurrencies prices.
Language:CSS19 3 05
faheel/file-extensions
JSON collection of scraped file extensions, along with their description and type, from FileInfo.com
Language:Python18 5 16
KenzoBH/Web-Scraping-and-EDA-iFood
Web Scraping and EDA from iFood website data.
Language:HTML15 1 02
HarshCasper/Blind-App-Reviews
Scraped reviews of over 25 companies from the Blind App ⚡️
14 3 16
malina/metascraper
Metascraper is a Crystal library for web scraping.
Language:Crystal11 4 01
DavidBellamy/visa_dates
Web scraper for US visa bulletins
Language:Python8 1 01
erogluegemen/ResearchRover
The research paper scrape bot is designed to help researchers and students find academic papers by scraping websites. The bot uses web scraping techniques to extract relevant information from these websites and presents it to users in an organized format.
Language:Python7 1 00
dorzel/username-generator
Generate a username
Language:Python6 2 61
fabio1623/mid-bootcamp-project
A data analysis project on the most popular podcasts on Spotify in Germany in December 2022, including scraped data, cleaned and enriched data, a Jupyter notebook, and images for a Tableau presentation.
Language:Jupyter Notebook6 2 00
hwasiti/smart-image-scraper
Deep learning-based image dataset cleaning of Flickr. Scraped metadata saved in MongoDB. Web app designed & deployed: https://bit.ly/smart_image_scraper
Language:Python6 2 01
shine-jayakumar/Web-Scraping-With-Python
Script to extract customer reviews from a webpage while bypassing bot challenge
Language:Python5 1 00
Ephellon/game-store-catalog
Catalog of PlayStation, Xbox, Nintendo, and Steam games
4 2 20
kztera/university-ranking
Scrape, analyze and visualize data from timeshighereducation.com about World University Ranking with Python.
Language:Jupyter Notebook4 1 00
Nyantuy/WEEBREAD
Read, and watch animanga
Language:Python4 0 02
samirkt/raw_food_recognition
Food recognition system for raw cooking ingredients (i.e. fruits, vegetables, etc.)
Language:Python4 2 00
sdl60660/cleveland_eviction_mapping
Mapping eviction filings in Cleveland by neighborhood using scraped data from the Cleveland Municipal Court website
Language:Python4 2 01
deepavadakan/Pet-Shelter-Adoption-Website
Website that helps people find their perfect lovable dog or cat & actually browse current adoption listings to source where to get a desired breed. Adopt a dog or cat - or BOTH!
Language:Jupyter Notebook3 1 00
ekapope/Baania-webscraping
Bangkok condo maket - webscraping using beautiful soup
Language:Jupyter Notebook3 0 01
kvba0000/upload-systems-archive
RIP Upload.Systems
3 1 00
junguler/TPDNE_example_images
some hand-picked images from thispersondoesnotexist.com
2 1 0
Miranda-Bai/anz_twitter
scraping #anz bank data from twitter by using twscrape package.
Language:Jupyter Notebook2 2 00
NomanSiddiqui0000/Rozee.pk-jobs-Scrapper
This scraper, built in Node.js using Puppeteer and Cheerio, is designed to extract job listings from the Rozee.pk website. It can scrape multiple pages and gather detailed information, including job titles, company names, skills, and more. The output is saved in structured CSV files, with sample datasets for cities like Lahore, Karachi, etc.
Language:JavaScript2

scraped-data

CUNY-CL/wikipron

joelbarmettlerUZH/Scrapeasy

warifp/Shopee-Scrape

ayaanzhaque/SDCNL

naqushab/SearchEngineScrapy

Swader/diffbot-php-client

tangible-idea/BitUtils

racinmat/mal-analysis

benjaminvdb/DBRD

recommend-games/board-game-scraper

palahsu/YouTubeScraper

fernandod1/ProductHunt-scraper

frossm/quoter

Merterm/Etymon

SuperKogito/CoinMarketCapScraper

faheel/file-extensions

KenzoBH/Web-Scraping-and-EDA-iFood

HarshCasper/Blind-App-Reviews

malina/metascraper

DavidBellamy/visa_dates

erogluegemen/ResearchRover

dorzel/username-generator

fabio1623/mid-bootcamp-project

hwasiti/smart-image-scraper

shine-jayakumar/Web-Scraping-With-Python

Ephellon/game-store-catalog

kztera/university-ranking

Nyantuy/WEEBREAD

samirkt/raw_food_recognition

sdl60660/cleveland_eviction_mapping

deepavadakan/Pet-Shelter-Adoption-Website

ekapope/Baania-webscraping

kvba0000/upload-systems-archive

junguler/TPDNE_example_images

Miranda-Bai/anz_twitter

NomanSiddiqui0000/Rozee.pk-jobs-Scrapper