Web scraper

My solution for Web intelligence (2DV515) project at Linnaeus University.

Requirements

In this project you shall use a web scraping library to download articles that can be used in your search engine from Assignment 3.

When scraping a site such as Wikipedia, you usually start on one page and follow all outgoing links.

You can download pages from Wikipedia or from any other site.

Parse the raw HTML files to generate a dataset similar to the Wikipedia dataset from Assignment 3
For each article, the dataset shall contain a file with all words in the article and another file with all outgoing links in the article

Will install all dependencies for scraper, client and server. It will also scrape 200 wiki articles. Keep in mind that this will take ~1 minute.

Will start both client and server.