It's a flask api that parses a url's scripts for matching services in the config file.
Add or modify sites in app/app/sites.yaml
:
sites:
karte:
name: KARTE
homepage: https://karte.io
example_site: https://right-on.co.jp
scripts:
- static.karte.io/libs/tracker.js
- static.karte.io/
Don't alter the directory structure or rename files, the docker image wants things in a very specific location.
chromedriver 2.43
severless-chrome 1.0.0-55
selenium 3.14 (Python package)
See adieuadieu/serverless-chrome#133
- Build the image:
docker build -t slackspy .
- Run the service:
docker run -p 80:80 slackspy
- Make a POST request:
curl -X POST -F 'url=https://mossy.jp/' http://localhost:80/scrape
zappa deploy