webscraping-data

There are 178 repositories under webscraping-data topic.

  • intergalacticalvariable/reader

    📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/

    Language:TypeScript24951438
  • TheWebScrapingClub/TheScrapingClubFree

    The Web Scraping Club Free Repository

    Language:HTML15114116
  • boringPpl/Linkedin-profiles-scraping

    Automatically scrape the web data of people profiles on Linkedin based on a specific search query

    Language:Jupyter Notebook663232
  • Seb943/scrapeVIN

    A python package for scraping vinted - all foreign versions aswell!

    Language:Python402211
  • antonio-nicolau/chaleno

    A Dart package to web scraping data from websites easily and faster using less code lines.

    Language:C++392812
  • chuksoo/IBM-Data-Science-Capstone-SpaceX

    In this project, we predicted if the Falcon 9 first stage will land successfully by following the data science methodology. We also summarized the results for the business stakeholders.

    Language:Jupyter Notebook392097
  • vishwapardeshi/NL_Parser_using_Spacy

    NLP parser using NER and TDD

    Language:Jupyter Notebook241110
  • Save-web-as-zip

    PRITHIVSAKTHIUR/Save-web-as-zip

    Save any web url as zip ( image + assets + html + css + js )

    Language:Python14102
  • IjayAbby/Web-Scraper-Ruby-Capstone-Project

    Web scraping, web harvesting, or web data extraction is data scraping used for extracting data from websites.

    Language:Ruby12211
  • dimitryzub/webscraping-py

    Web Scraping scripts for all Google, other search engines, and other websites (currently outdated, something may not be working).

    Language:Python11402
  • kingjosephm/vehicle_make_model_dataset

    Scrapes Google to create a ~700k sample of US passenger vehicle images with 574 distinct make-models

    Language:Jupyter Notebook9115
  • erogluegemen/TDK-Dataset

    Kaggle: https://www.kaggle.com/datasets/erogluegemen/tdk-turkish-words

    Language:Jupyter Notebook7101
  • CoderNitu/Data_Scraping_and_Analyzing_Economic_Data

    Data Scraping Economic Data

    Language:Jupyter Notebook510
  • Rgpv_result_checker_application

    devnamdev2003/Rgpv_result_checker_application

    The goal of this project is to develop a web-based system that allows college students to check their results online using the Django framework and the Python Requests library. The system will enable students to view their grades and academic performance for a given semester, including their GPA and any remarks from their teachers.

    Language:Python5101
  • R1SH4BH81/imdbBot

    IMDB TELEGRAM BOT : Get movie details like title, year, genres, runtime, rating & cast. Greet users with personalized messages & handle related suggestions. Enjoy movie browsing! 🍿🎥

    Language:Python5106
  • FahimFBA/simple-web-scrapper

    Extract data from websites using the web-scrapper. Made with nodejs, ExpressJS, axios & cheerio.

    Language:JavaScript4
  • R3DHULK/web-scrapper-in-perl

    Web Scrapper In Perl

    Language:Perl4101
  • sakan811/SakuYado

    Discover the ideal accommodation with a Review/Price analyzer.

    Language:TypeScript4
  • swati-gwc/DramaList

    Drama Web Scraping Project

    Language:Python4310
  • johnpdevlin/Oireachtas-App

    Web application to show politician, party, and constituency details. Data scraped from webpages, pdfs, and APIs. Functions analyses and restructures raw data to write qualitative records of politicians’ level of engagement and attendance as well as provide aggregated info. [[ Currently being reworked and expanded ]]

    Language:TypeScript3100
  • ng10op/TradeSphere

    TradeSphere is a web-based application designed for stock analysis, utilizing web scraping to collect, analyze, and visualize stock market data.

    Language:JavaScript3100
  • RiccardoRevalor/AInvest

    *DEV* AInvest is a Python tool that empowers NLP, LLMs and Gen-AI to create personalized report about the stock the user wants to analyze. Data used to evaluate each stock are scraped from various high-quality sources. Disclaimer: This software is provided for educational purposes only. The author is not responsible for any misuse of this software

    Language:HTML30
  • ADVAIT135/Forage-British-Airways-Data-Science-Job-Sim

    This Repository consist of all the Jupyter Notebooks, Images and .CSV files of the tasks that were assigned during the British Airways Data Job Sim hosted on Forage

    Language:Jupyter Notebook2100
  • andreuvv/myl_scraper

    tor.myl.cl web scraper for TCG Mitos y Leyendas (MyL)

    Language:Python20
  • databyharriet/Web-Scraping-Project

    This project contains Python-based web scraping projects designed to automate data collection from online sources. Using BeautifulSoup and requests, these projects efficiently extract and process relevant information.

    Language:Jupyter Notebook2
  • datacollectionspecialist/web-scraping-tool

    Top 5 web scraping tools:#1.scrapeless. #2.Content Grabber.#3.Diffbot.

  • dvaishna/Indeed_Jobs_Scrapping

    This project aims to scrape IT job listings from Indeed, a popular job search platform, using web scraping techniques.

    Language:Python2100
  • kgmuchiri/AthleticScraper

    A python web scraper for the World Athletics website

    Language:Python2
  • ksn-developer/webcrawler

    This repository contains Python code for web crawling. It is built using the BeautifulSoup library and allows you to extract text from web pages and store it in text files. The crawler can also extract hyperlinks from web pages and crawl them recursively.This code will be a great starting point for your own web scraping projects

    Language:Python2101
  • maheshdbabar9340/Web_Scraping

    Data Scraping from websites like Jio Mart, Newspapers like Amar Light and Daily Marathi Bhaskar and Data scraping of All NGO's from India categorized with different states and cities in India.

    Language:Jupyter Notebook2101
  • mihirs16/Segmentation-Clustering-of-Neighbourhoods-Python

    IBM Data Science Professional Certificate Capstone Project

    Language:Jupyter Notebook2101
  • msartortt/Project-Week-6

    Hypothesis testing for a movie database

    Language:Python2000
  • mukul-mschauhan/WebScraping

    Unlock the Power of Web Scraping with Beautiful Soup, Selenium, and More - All in One Repository!

    Language:Jupyter Notebook2101
  • tayerthiaggo/arcrest2shp

    Access an ArcGIS REST Services Directory and download all links as shapefiles

    Language:Python2100
  • Tyler4338/APR-Facebook-Web-Scraper

    This is the Web Scraping software made for the research paper of "Methods of modern data extraction: Investigation into the Processes of Web Scraping and its Application to the Social media Platform of Facebook to Create Comprehensive User Profiles". Details as to the functions of each of the applications, python libraries, and taken ethical measures is listed and explained in the python program itself as comments.

    Language:Python2201