/WhatWeb

Website Fingerprinter

Primary LanguageRubyGNU General Public License v2.0GPL-2.0

.$$$     $.                                   .$$$     $.
$$$$     $$. .$$$  $$$ .$$$$$$.  .$$$$$$$$$$. $$$$     $$. .$$$$$$$. .$$$$$$.
$ $$     $$$ $ $$  $$$ $ $$$$$$. $$$$$ $$$$$$ $ $$     $$$ $ $$   $$ $ $$$$$$.
$ `$     $$$ $ `$  $$$ $ `$  $$$ $$' $ `$ `$$ $ `$     $$$ $ `$      $ `$  $$$'
$. $     $$$ $. $$$$$$ $. $$$$$$ `$  $. $  :' $. $     $$$ $. $$$$   $. $$$$$.
$::$  .  $$$ $::$  $$$ $::$  $$$     $::$     $::$  .  $$$ $::$      $::$  $$$$
$;;$ $$$ $$$ $;;$  $$$ $;;$  $$$     $;;$     $;;$ $$$ $$$ $;;$      $;;$  $$$$
$$$$$$ $$$$$ $$$$  $$$ $$$$  $$$     $$$$     $$$$$$ $$$$$ $$$$$$$$$ $$$$$$$$$'

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Readme for WhatWeb - Next generation web scanner.
Developed by Andrew Horton aka urbanadventurer and Brendan Coles
Version: 0.4.8. Unreleased
License: GPLv2

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

This product is subject to the terms detailed in the license agreement. For more information about WhatWeb visit:

	Homepage:	http://www.morningstarsecurity.com/research/whatweb
	Wiki:		https://github.com/urbanadventurer/WhatWeb/wiki/

If you have any questions, comments or concerns regarding WhatWeb, please consult the documentation prior to contacting one of the developers. Your feedback is always welcome. 


	Contents
	========================================================================
	1.  About WhatWeb
	2.  Example Usage
	3.  Usage
	4.  Logging & Output
	5.  Plugins
	6.  Aggression
	7.  Performance & Stability
	8.  Optional Dependencies
	9.  Release History
	10. Credits
	11. Updates & Additional Information
	========================================================================



1. About WhatWeb
================================================================================

WhatWeb identifies websites. Its goal is to answer the question, "What is that Website?". WhatWeb recognises web technologies including content management systems (CMS), blogging platforms, statistic/analytics packages, JavaScript libraries, web servers, and embedded devices. WhatWeb has over 1500 plugins, each to recognise something different. WhatWeb also identifies version numbers, email addresses, account IDs, web framework modules, SQL errors, and more.

WhatWeb can be stealthy and fast, or thorough but slow. WhatWeb supports an aggression level to control the trade off between speed and reliability. When you visit a website in your browser, the transaction includes many hints of what web technologies are powering that website. Sometimes a single webpage visit contains enough information to identify a website but when it does not, WhatWeb can interrogate the website further. The default level of aggression, called 'stealthy', is the fastest and requires only one HTTP request of a website. This is suitable for scanning public websites. More aggressive modes were developed for use in penetration tests.

Most WhatWeb plugins are thorough and recognise a range of cues from subtle to obvious. For example, most WordPress websites can be identified by the meta HTML tag, e.g. '<meta name="generator" content="WordPress 2.6.5">', but a minority of WordPress websites remove this identifying tag but this does not thwart WhatWeb. The WordPress WhatWeb plugin has over 15 tests, which include checking the favicon, default installation files, login pages, and checking for "/wp-content/" within relative links.


Features:
	* Over 1500 plugins
	* Control the trade off between speed/stealth and reliability
	* Performance tuning. Control how many websites to scan concurrently.
	* Multiple log formats: Brief (greppable), Verbose (human readable), XML, JSON, MagicTree, RubyObject, MongoDB.
	* Proxy support including TOR
	* Custom HTTP headers
	* Basic HTTP authentication
	* Control over webpage redirection
	* Nmap-style IP ranges
	* Fuzzy matching
	* Result certainty awareness
	* Custom plugins defined on the command line



2. Example Usage
================================================================================

Using WhatWeb on a couple of websites (standard WhatWeb output is in colour):

$ ./whatweb slashdot.org reddit.com
http://reddit.com [302] HTTPServer[AkamaiGHost], RedirectLocation[http://www.reddit.com/], Via-Proxy[1.1 bc1], IP[173.223.232.64], Akamai-Global-Host, Country[UNITED STATES][US]
http://slashdot.org [200] Script, HTTPServer[Unix][Apache/1.3.42 (Unix) mod_perl/1.31], Google-Analytics[GA][32013], Via-Proxy[1.1 bc5], UncommonHeaders[x-fry,x-varnish,x-xrds-location,slash_log_data], Apache[1.3.42][mod_perl/1.31], HTML5, IP[216.34.181.45], OpenGraphProtocol[100000696822412], X-Powered-By[Slash 2.005001], Title[Slashdot: News for nerds, stuff that matters], Email[canadaboy@nOspam.gmail.com,jbort@nww.com], Country[UNITED STATES][US]
http://www.reddit.com/ [200] Frame, PasswordField[passwd,passwd2], Script, HTTPServer['; DROP TABLE servertypes; --], IP[203.97.86.202], JQuery, Cookies[reddit_first], Title[reddit: the voice of the internet -- news before it happens], Country[NEW ZEALAND][NZ]




3. Usage
================================================================================

.$$$     $.                                   .$$$     $.         
$$$$     $$. .$$$  $$$ .$$$$$$.  .$$$$$$$$$$. $$$$     $$. .$$$$$$$. .$$$$$$. 
$ $$     $$$ $ $$  $$$ $ $$$$$$. $$$$$ $$$$$$ $ $$     $$$ $ $$   $$ $ $$$$$$.
$ `$     $$$ $ `$  $$$ $ `$  $$$ $$' $ `$ `$$ $ `$     $$$ $ `$      $ `$  $$$'
$. $     $$$ $. $$$$$$ $. $$$$$$ `$  $. $  :' $. $     $$$ $. $$$$   $. $$$$$.
$::$  .  $$$ $::$  $$$ $::$  $$$     $::$     $::$  .  $$$ $::$      $::$  $$$$
$;;$ $$$ $$$ $;;$  $$$ $;;$  $$$     $;;$     $;;$ $$$ $$$ $;;$      $;;$  $$$$
$$$$$$ $$$$$ $$$$  $$$ $$$$  $$$     $$$$     $$$$$$ $$$$$ $$$$$$$$$ $$$$$$$$$'

WhatWeb - Next generation web scanner version 0.4.8-dev.
Developed by Andrew Horton aka urbanadventurer and Brendan Coles
Homepage: http://www.morningstarsecurity.com/research/whatweb

Usage: whatweb [options] <URLs>

TARGET SELECTION:
  <URLs>		Enter URLs, filenames or nmap-format IP ranges.
			Use /dev/stdin to pipe HTML directly
  --input-file=FILE, -i	Identify URLs found in FILE. You can pipe
			hostnames or URLs directly with -i /dev/stdin

TARGET MODIFICATION:
  --url-prefix		Add a prefix to target URLs
  --url-suffix		Add a suffix to target URLs
  --url-pattern		Insert the targets into a URL. Requires --input-file,
			eg. www.example.com/%insert%/robots.txt 

AGGRESSION:
  The aggression level controls the trade-off between speed/stealth and
  reliability.
  --aggression, -a=LEVEL Set the aggression level. Default: 1
  Aggression levels are:
  1. Stealthy	Makes one HTTP request per target. Also follows redirects.
  2. Unused
  3. Aggressive	Can make a handful of HTTP requests per target. This triggers
  		aggressive plugins for targets only when those plugins are
  		identified with a level 1 request first.
  4. Heavy	Makes a lot of HTTP requests per target. Aggressive tests from
  		all plugins are used for all URLs.

HTTP OPTIONS:
  --user-agent, -U=AGENT Identify as AGENT instead of WhatWeb/0.4.8-dev.
  --header, -H		Add an HTTP header. eg "Foo:Bar". Specifying a default
			header will replace it. Specifying an empty value, eg.
			"User-Agent:" will remove the header.
  --follow-redirect=WHEN Control when to follow redirects. WHEN may be `never',
			`http-only', `meta-only', `same-site', `same-domain'
			or `always'. Default: always
  --max-redirects=NUM	Maximum number of contiguous redirects. Default: 10

AUTHENTICATION:
  --user, -u=<user:password> HTTP basic authentication
  --cookie, -c=COOKIES	Provide cookies, e.g. 'name=value; name2=value2  '

PROXY:
  --proxy		<hostname[:port]> Set proxy hostname and port
			Default: 8080
  --proxy-user		<username:password> Set proxy user and password

PLUGINS:
  --list-plugins, -l	List all plugins
  --plugins, -p=LIST	Select plugins. LIST is a comma delimited set of 
			selected plugins. Default is all.
			Each element can be a directory, file or plugin name and
			can optionally have a modifier, eg. + or -
			Examples: +/tmp/moo.rb,+/tmp/foo.rb
			title,md5,+./plugins-disabled/
			./plugins-disabled,-md5
			-p + is a shortcut for -p +plugins-disabled
  --info-plugins, -I=PLUGINS	Display detailed information for plugins.
			Optionally search with keywords in a comma delimited
			list.
  --grep, -g=STRING	Search for STRING in HTTP responses. Reports with a
			plugin named Grep
  --custom-plugin=DEFINITION	Define a custom plugin named Custom-Plugin,
			Examples: ":text=>'powered by abc'"
			":version=>/powered[ ]?by ab[0-9]/"
			":ghdb=>'intitle:abc \"powered by abc\"'"
			":md5=>'8666257030b94d3bdb46e05945f60b42'"
			"{:text=>'powered by abc'},{:regexp=>/abc [ ]?1/i}"
  --dorks=PLUGIN	List google dorks for the selected plugin

OUTPUT:
  --verbose, -v		Verbose output includes plugin descriptions. Use twice
			for debugging.
  --colour,--color=WHEN	control whether colour is used. WHEN may be `never',
			`always', or `auto'
  --quiet, -q		Do not display brief logging to STDOUT
  --no-errors		Suppress error messages

LOGGING:
  --log-brief=FILE	Log brief, one-line output
  --log-verbose=FILE	Log verbose output
  --log-xml=FILE	Log XML format
  --log-json=FILE	Log JSON format
  --log-json-verbose=FILE Log JSON Verbose format
  --log-magictree=FILE	Log MagicTree XML format
  --log-object=FILE	Log Ruby object inspection format
  --log-mongo-database	Name of the MongoDB database
  --log-mongo-collection Name of the MongoDB collection. Default: whatweb
  --log-mongo-host	MongoDB hostname or IP address. Default: 0.0.0.0
  --log-mongo-username	MongoDB username. Default: nil
  --log-mongo-password	MongoDB password. Default: nil
  --log-errors=FILE	Log errors

PERFORMANCE & STABILITY:
  --max-threads, -t	Number of simultaneous threads. Default: 25.
  --open-timeout	Time in seconds. Default: 15
  --read-timeout	Time in seconds. Default: 30
  --wait=SECONDS	Wait SECONDS between connections
			This is useful when using a single thread.

HELP & MISCELLANEOUS:
  --help, -h		This help
  --debug		Raise errors in plugins
  --version		Display version information. (WhatWeb 0.4.8-dev)

EXAMPLE USAGE:
* Scan example.com
  whatweb example.com
* Scan reddit.com slashdot.org with verbose plugin descriptions
  whatweb -v reddit.com slashdot.org
* An aggressive scan of mashable.com detects the exact version of Wordpress
  whatweb -a 3 mashable.com
* Scan the local network quickly with 255 threads and suppress errors
  whatweb --no-errors -t 255 192.168.0.0/24



4. Logging & Output
================================================================================

The following types of logging are supported:
	--log-brief=FILE	Brief, one-line, greppable format
	--log-verbose=FILE	Verbose
	--log-xml=FILE		XML format. XSL stylesheet is provided
	--log-json=FILE		JSON format
	--log-json-verbose=FILE	JSON verbose format
	--log-magictree=FILE	MagicTree XML format
	--log-object=FILE	Ruby object inspection format
	--log-mongo-database	Name of the MongoDB database
	--log-mongo-collection	Name of the MongoDB collection. Default: whatweb
	--log-mongo-host	MongoDB hostname or IP address. Default: 0.0.0.0
	--log-mongo-username	MongoDB username. Default: nil
	--log-mongo-password	MongoDB password. Default: nil
	--log-errors=FILE	Log errors. This is usually printed to the screen in red.

You can output to multiple logs simultaneously by specifying multiple command line logging options. Advanced users who want SQL output should read the source code to see unsupported features.


5. Plugins
================================================================================

Matches are made with:
        * Text strings (case sensitive)
        * Regular expressions
        * Google Hack Database queries (limited set of keywords)
        * MD5 hashes
        * URL recognition
        * HTML tag patterns
        * Custom ruby code for passive and aggressive operations

To list the plugins supported:

$ ./whatweb -l

WhatWeb Plugin List

Plugin Name               Description
-------------------------------------------------------------------------------
1024-CMS                  1024 is one of a few CMS's leading the way with the i
360-Web-Manager           360-Web-Manager - homepage: http://www.360webmanager.
3COM-NBX                  3COM NBX phone system. The NBX NetSet utility is a we
... (truncated - there are a lot)



To view more detail about a plugin or search plugins for a keyword:

$ ./whatweb -I phpBB
WhatWeb Plugin Information
Searching for phpBB
--------------------------------------------------------------------------------
Plugin Name               Details
phpBB
        Author:              Andrew Horton
        Version:             0.3
        Examples:            19
        Matches:             7
        Dorks:               1
        Passive function:    Yes
        Aggressive function: Yes
        Version detection:   Yes
        Description:
        phpBB is a free forum - Homepage: http://phpbb.org/

--------------------------------------------------------------------------------
1 plugins found


All plugins are loaded by default.

Plugins can be selected by directories, files or plugin names as a comma delimited list with the -p or --plugin command line option.

Each list item may have a modifier: + adds to the full set, - removes from the full set and no modifier overrides the defaults.

Examples :

	--plugins +plugins-disabled,-foobar
	--plugins +/tmp/moo.rb
	--plugins foobar (only select foobar)
	-p title,md5,+./plugins-disabled/
	-p ./plugins-disabled,-md5


The --dorks <plugin name> command line option returns google dorks for the selected plugin.

For example, --dorks wordpress returns "is proudly powered by WordPress"

The --grep, -g command line option searches the target page for the selected string and returns a match in a plugin called Grep if it is found.


6. Aggression
================================================================================

WhatWeb features several levels of aggression. By default the aggression level is set to 1 (stealthy) which sends a single HTTP GET request and also follows redirects.

--aggression, -a

	1. Stealthy	Makes one HTTP request per target. Also follows redirects.
  	2. Unused
  	3. Aggressive	Can make a handful of HTTP requests per target. This triggers
	  		aggressive plugins for targets only when those plugins are
	  		identified with a level 1 request first.
	4. Heavy	Makes a lot of HTTP requests per target. Aggressive tests from
  			all plugins are used for all URLs.

Level 3 aggressive plugins will guess more URLs and perform actions that are potentially unsuitable without permission. WhatWeb currently does not support any intrusion/exploit level tests in plugins.

An example of the different results between level 1 and level 3:
-----------------------------------------------------------------
A level 1, stealthy scan identifes that smartor.is-root.com/forum/ uses phpBB version 2:

$ ./whatweb smartor.is-root.com/forum/
http://smartor.is-root.com/forum/ [200] PasswordField[password], HTTPServer[Apache/2.2.15], PoweredBy[phpBB], Apache[2.2.15], IP[88.198.177.36], phpBB[2], PHP[5.2.13], X-Powered-By[PHP/5.2.13], Cookies[phpbb2mysql_data,phpbb2mysql_sid], Title[Smartors Mods Forums - Reloaded], Country[GERMANY][DE]

A level 3, aggressive scan triggers additional tests in the phpBB plugin which identifies that the website uses phpBB version 2.0.20 or higher:

$ ./whatweb -p plugins/phpbb.rb -a 3 smartor.is-root.com/forum/
http://smartor.is-root.com/forum/ [200] phpBB[2,>2.0.20]

Note the use of the -p argument to select only the phpBB plugin. It is advisable, but not mandatory, to select a specific plugin when attempting to fingerprint software versions in aggressive mode. This approach is far more stealthy as it will limit the number of requests.

WhatWeb has no caching so if you use aggressive plugins on redirecting URLs you may fetch the same files multiple times.


7. Performance & Stability
================================================================================

WhatWeb features several options to increase performance and stability.

  --max-threads, -t     Number of simultaneous threads. Default: 25.
  --open-timeout        Time in seconds. Default: 15
  --read-timeout        Time in seconds. Default: 30
  --wait=SECONDS        Wait SECONDS between connections
                        This is useful when using a single thread.

The --wait and --max-threads commands can be used to assist in IDS evasion.

Changing the user-agent using the -U or --user-agent command line option will avoid the Snort IDS rule for WhatWeb.

If you are scanning ranges of IP addresses, it is much more efficient to use a port scanner like nmap to discover which have port 80 open before scanning with WhatWeb.

Character set detection, with the Charset plugin dramatically decreases performance by requiring more CPU. This is required by JSON and MongoDB logging.



8. Optional Dependencies
================================================================================

To enable JSON logging install the json gem.
	gem install json

To enable MongoDB logging install the mongo gem.
	gem install mongo

To enable character set detection and MongoDB logging install the rchardet gem.
	gem install rchardet



9. Release History
================================================================================

Version 0.3   Released at Kiwicon III (kiwicon.org), November 2nd, 2009
Version 0.4   Released March 14th, 2010
Version 0.4.1 Released April 28th, 2010
Version 0.4.2 Released April 30th, 2010
Version 0.4.3 Released May 24th, 2010
Version 0.4.4 Released June 29th, 2010
Version 0.4.5 Released August 17th, 2010
Version 0.4.6 Released March 25th, 2011
Version 0.4.7 Released April 5th, 2011
Version 0.4.8-dev Unreleased

10. Credits
================================================================================

Written by urbanadventurer aka Andrew Horton and Brendan Coles
Homepage: http://www.morningstarsecurity.com/research/whatweb
License: GPLv2


DEVELOPERS

Andrew Horton
Brendan Coles


CONTRIBUTORS

Thank you to the following people who have contributed to WhatWeb

Emilio Casbas
Louis Nyffenegger
Patrik Wallström
Caleb Anderson
Tonmoy Saikia
Aung Khant
Erik Inge Bolsø
nk@dsigned.gr
Steve 'Ashcrow' Milner
Michal Ambroz for writing the Makefile and Man pages
Gremwell for improving the MagicTree logging
Sagar Prakash Junnarkar
GertBerger 
Quintin Poirier
Eric Sesterhenn
dengjw (jawa)
Pedro Worcel (droop)
Matthieu Keller (maggick)
Peter (pvdl)
Napz (RootCon)

11. Updates & Additional Information
================================================================================

The WhatWeb development build features regular updates.

	* WhatWeb-dev:		https://github.com/urbanadventurer/WhatWeb/
	* WhatWeb-dev-unstable:	https://github.com/bcoles/WhatWeb/

Browse the wiki for more documentation and advanced usage techniques.

	* Wiki: https://github.com/urbanadventurer/WhatWeb/wiki/