/crawler

Primary LanguageJava

o How to build and run the solution ? Set JDK 11 and gradle in build environment the firewall should permit access to the websites

./gradlew clean build shadowJar

java -jar build/libs/crawler-1.0-SNAPSHOT-all.jar

Notice the output file generated by name "output.txt"

o Explanation of what could be done with more time

Validation of input - url and output file

Static test files for PageCrawler - Avoid test relying on external pages

The output to have a tree structure - better formatting

Concurrent crawl of the child pages