/ta-crx

chrome extension to scrape review data from tripadvisor

Primary LanguageJavaScript

Step 1:

install tripadvisor-extension.crx on chrome

Keyword: location to search for on tripadvisor
Max Places: maximum number of places to scrape
Max Reviews: maximum number of reviews to scrape Save Images: whether to save images at back end

demo

Step 2:

cd server
python server.py --port=<port> --output-dir=<output directory>

Step 3:

In case server crashes or service worker fails to communicate with server for a job, content script will initiate download for raw json file. To process this file run:

cd server
python processor.py --input-file=<path to raw json> --output-dir=<output directory>