/sitemaped

Parser for XML sitemaps which respects robots.txt and handles gziped and nested sitemaps as well.

Primary LanguageRubyMIT LicenseMIT

Sitemaped

Sitemaped is a powerful parser for XML sitemaps which respects all sitemaps listed in robots.txt and handles gziped and nested sitemaps as well.

Features

  • Respects sitemaps listed in robots.txt
  • Handles gziped sitemaps
  • Supports nested sitemaps (sitemap of sitemaps)

Installation

With Bundler

Just add to your Gemfile

gem 'sitemaped'

Without Bundler

If you're not using Bundler just execute on your commandline

$ gem install sitemaped

Usage

Get a list of all URLs covered by the sitemap(s)

require 'sitemaped'

website = Sitemaped.new('http://www.example.com')
sitemap = website.sitemap  # => ["http://www.example.com/", "http://www.example.com/contact", ...]

Check whether an URL is covered by the sitemap

require 'sitemaped'

sitemap = Sitemaped.new('http://www.example.com')
sitemap.include?('http://www.example.com/contact') # => true or false

Todo

  • Add tests