/hk_restaurants

Hong Kong restaurants web scraper

Primary LanguagePythonApache License 2.0Apache-2.0

HK_Lic_Restaurants codecov

Hong Kong Restaurants Web Scraper

This project is aimed to scrape restaurants information from websites. Currently, fehd (Licensed Restaurants registered with Food and Environmental Hygiene Department in Hong Kong)is the only spider available in this project.

Installation

$ git clone git@github.com:sammyfung/hk_restaurants.git
$ cd hk_restaurants
$ python3 -m venv venv
$ source venv/bin/activate  
$ pip install -r requirements.txt

Quickstart

Run the web scraper and export scraped data to CSV file.

$ cd hk_restaurants/hk_restaurants    
$ scrapy crawl fehd -o fehd.csv