scraped-data
There are 91 repositories under scraped-data topic.
CUNY-CL/wikipron
Massively multilingual pronunciation mining
joelbarmettlerUZH/Scrapeasy
Scraping in python made easy - receive the content you like in just one line of code
warifp/Shopee-Scrape
Shopee Scrape is a tool that functions to collect data - the data needed, such as finding data from photos, prices, names, store locations and others.
ayaanzhaque/SDCNL
Deep Learning for Suicide and Depression Identification with Unsupervised Label Correction (ICANN 2021)
naqushab/SearchEngineScrapy
Scrape data from Google.com, Bing.com, Baidu.com, Ask.com, Yahoo.com, Yandex.com
Swader/diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
tangible-idea/BitUtils
Systematic coin price notifier, Telegram public channel history parser, Trading tool with python
racinmat/mal-analysis
github repo for MyAnimeList analysis. Also links to the MAL dataset.
benjaminvdb/DBRD
110k Dutch Book Reviews Dataset for Sentiment Analysis
recommend-games/board-game-scraper
Board game data scraper
palahsu/YouTubeScraper
Scraping YouTube Video Description and Video Likes and Comments and Times and Replies! It's Automatically Extracting Data from Video.
fernandod1/ProductHunt-scraper
Producthunt.com famous website scraper script. Scrap all offers and save in spreadsheet excel file.
frossm/quoter
Command line utility to display stock quotes and index data
Merterm/Etymon
Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.
SuperKogito/CoinMarketCapScraper
a small python scraper to scrape historical data from the CoinMarketCap website and convert it to csv files . This is an initial step for a data mining process to develop a predictive model of cryptocurrencies prices.
faheel/file-extensions
JSON collection of scraped file extensions, along with their description and type, from FileInfo.com
KenzoBH/Web-Scraping-and-EDA-iFood
Web Scraping and EDA from iFood website data.
HarshCasper/Blind-App-Reviews
Scraped reviews of over 25 companies from the Blind App ⚡️
malina/metascraper
Metascraper is a Crystal library for web scraping.
DavidBellamy/visa_dates
Web scraper for US visa bulletins
erogluegemen/ResearchRover
The research paper scrape bot is designed to help researchers and students find academic papers by scraping websites. The bot uses web scraping techniques to extract relevant information from these websites and presents it to users in an organized format.
dorzel/username-generator
Generate a username
fabio1623/mid-bootcamp-project
A data analysis project on the most popular podcasts on Spotify in Germany in December 2022, including scraped data, cleaned and enriched data, a Jupyter notebook, and images for a Tableau presentation.
hwasiti/smart-image-scraper
Deep learning-based image dataset cleaning of Flickr. Scraped metadata saved in MongoDB. Web app designed & deployed: https://bit.ly/smart_image_scraper
shine-jayakumar/Web-Scraping-With-Python
Script to extract customer reviews from a webpage while bypassing bot challenge
Ephellon/game-store-catalog
Catalog of PlayStation, Xbox, Nintendo, and Steam games
kztera/university-ranking
Scrape, analyze and visualize data from timeshighereducation.com about World University Ranking with Python.
Nyantuy/WEEBREAD
Read, and watch animanga
samirkt/raw_food_recognition
Food recognition system for raw cooking ingredients (i.e. fruits, vegetables, etc.)
sdl60660/cleveland_eviction_mapping
Mapping eviction filings in Cleveland by neighborhood using scraped data from the Cleveland Municipal Court website
deepavadakan/Pet-Shelter-Adoption-Website
Website that helps people find their perfect lovable dog or cat & actually browse current adoption listings to source where to get a desired breed. Adopt a dog or cat - or BOTH!
ekapope/Baania-webscraping
Bangkok condo maket - webscraping using beautiful soup
kvba0000/upload-systems-archive
RIP Upload.Systems
junguler/TPDNE_example_images
some hand-picked images from thispersondoesnotexist.com
Miranda-Bai/anz_twitter
scraping #anz bank data from twitter by using twscrape package.
NomanSiddiqui0000/Rozee.pk-jobs-Scrapper
This scraper, built in Node.js using Puppeteer and Cheerio, is designed to extract job listings from the Rozee.pk website. It can scrape multiple pages and gather detailed information, including job titles, company names, skills, and more. The output is saved in structured CSV files, with sample datasets for cities like Lahore, Karachi, etc.