Respect Robots.txt Files

Question

Respect Robots.txt Files

mgifford opened this issue 3 years ago · 1 comments

Sites should respect the robots.txt files that some sites use to manage traffic.

Would be great if by default the scanner respected the wishes of the site owner.

Answer 1 · 2024-01-31T09:14:10.000Z

We have developed a feature to follow robots.txt with -r flag when running node cli

  -r, --followRobots                 Option for crawler to adhere to robots.txt
                                     rules if it exists
                                 [string] [choices: "yes", "no"] [default: "no"]