This service is a REST API for extracting HTML content from a URL. It uses the PhantomJS headless browser to crawl a given URL, then applies the Readability algorithm to strip user interface text and advertisements.
This service is configured using environment variables. The default values are put in the .env file.
- MEMCACHED_SERVER: Host and port of memcached server in the format host:port.
To run the service in a development mode, install the NPM modules and use the npm scripts in package.json as shown below.
$ nvm use
$ npm install
$ npm run build
Now to run the compiled server (production):
$ node dist/server.js
You can run it using babel-node for development:
$ npm run dev
Run the Standard syntax and style checker on the source code:
$ npm run lint