/computer-scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

Primary LanguageTypeScriptMIT LicenseMIT

Computer Scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

Objectives

  • Use web scraping techniques to the selected websites to obtain the data. (minimum 03 web pages and 02 web scraping techniques)
  • Build the Dashboard in Power BI that shows the dynamic analysis of the data

Steps

1. Search for companies that sell computers in Trujillo

2. Design of the data model

datamodel

Download Data Model PDF File here

3. Search for tools for data extraction

  • Node JS - Javascript Engine
  • Puppeteer - Web page manipulation tool
  • Cheerio JS - Web page querier tool like JQuery
  • Excel JS - Tool for exporting extracted data to XLSX file