/crawler

Crawl links recursively in a web page

Primary LanguagePython

crawler

Recursively crawl links from a given webpage in a breadth first (BFS) approach.

An UPPER_LIMIT can be set on the number of links to be crawled.

USAGE:

Download the project zip from this page and extract. Run Link crawler using:

python3 LinkCrawler.py URL UPPER_LIMIT

EXAMPLE:

python3 LinkCrawler.py https://google.com 100

NOTE:

Currently implemented in python3 only.

If no UPPER_LIMIT is specified the program will run indefinitely untill stopped manually.

The output will be saved in a file output.txt. It's path will be displayed upon script execution.