URL can be inserted by API. The flow is shown with green line in the picture above
- User insert URL. The API will then insert the URL to MongoDB
- API will add the URL to the Queue as well
- Queue will send the URL to Crawler
- Crawler will crawl the details of the URL
- URL details will be saved to MongoDb
- Scheduler get the URLs from MongoDB
- And then, the Scheculer will add the URLs to the Queue
- Queue will send the URL to Crawler
- Crawler will crawl the details of the URL
- URL details will be saved to MongoDb
- Install
yarn
https://yarnpkg.com/en/docs/install - Install
Docker
https://www.docker.com/products/docker-desktop - Run
yarn install
in terminal, in this project root folder
./run.sh
./kill.sh