/robotex

Ruby library to obey robots.txt

Primary LanguageRubyMIT LicenseMIT

Robotex

Obey Robots.txt

With one line of code, Robotex (pronounced like “robotics”) will download and parse the robots.txt file and let you know if your program is allowed to visit a given link.

Usage:

robotex = Robotex.new "My User Agent"
robotex.allowed?("http://www.example.com/foo")
robotex.delay!("http://www.example.com/foo") # wait until any specified Crawl-Delay has passed

Acknowledgements

Robotex is a modified version of Kyle Maxwell’s excellent Robots library. Some folks were unable to use that gem due to packaging issues, so I used his code to create Robotex.