/Masscan-Proxies-Tester

Test proxies returned by a Masscan scan

Primary LanguagePython

Parse the output file of a Masscan scan and try to connect to a website with IPs:Port as the proxy. The script fills a queue with the proxies and use multiple threads to consume it. The format expected is produced by Masscan with the -oL parameter. However, it should be easy to adjust the script to support other formats. You could adjust the function parse_results to read the scan results from another data repository for example.

To keep track of the proxies tested, the script try to detect the ODBC driver and connects to the local MySQL database with it. Don't forget to change your password in the connection string!

Installation

RHEL / CentOS:

# yum install unixODBC
# yum install unixODBC-devel

Ubuntu :

# apt-get install python3-dev
# apt-get install unixodbc-dev

Then:

# pip3.7 install -r requirements.txt

Usage

# python3.7 process.py --help
Usage: process.py [options]

Options:
  -h, --help            show this help message and exit
  -m MASSCAN_RESULTS, --masscan=MASSCAN_RESULTS
                        Specify the file containing Masscan's results.
                        Default: /root/masscan/data/out.txt
  -w WEBSITE, --website=WEBSITE
                        (Optional) Specify the website used to test the
                        proxies. Default: http://perdu.com
  -p THREADS, --thread=THREADS
                        (Optional) Specify the number of threads used to test
                        the proxies. Default: 10
  -t TIMEOUT, --timeout=TIMEOUT
                        (Optional) Specify the timeout period when testing a
                        proxy. Default: 6
  -q QUEUE_SIZE, --queue=QUEUE_SIZE
                        (Optional) Specify the size of the queue. Default:
                        10000
  -i IGNORE, --ignore=IGNORE
                        (Optional) Ignore integrity validation of returned
                        content
  -v VERBOSITY, --verbose=VERBOSITY
                        (Optional) Set the level of logging to DEBUG. Default:
                        INFO
  -r REVERSE, --reverse=REVERSE
                        (Optional) Start reading the results file from the
                        end. Useful when you want to restart the script from a
                        large file that has been already partially processed.
                        Default: In order

Masscan usage example

# masscan 0.0.0.0/0 -p 3128,8080,5555,8000 --excludefile exclude.conf -oL out.txt

Masscan-Proxies-Tester usage example

# python3.7 process.py -p 50 -q 500 -m scan_final.txt -t 7
2019-01-11 00:06:18,250 - Hash value of the content of http://www.perdu.com : aeb7415366b7e5bd8f33664efa7ff31b
2019-01-11 00:06:19,250 - Reading scan_final.txt
2019-01-11 00:06:19,635 - Starting 50 threads for processing
Total processed:  19% [#       ] 385964/2000000 (269 Successful 314024 Invalid) - Queue size: 499/500 | ETA:  2 days, 6:26:15 | Elapsed Time: 13:01:03
....

Integrity validation

Specifying -i or --ignore override this validation.

By default, the content of the page returned by a proxy (when the connection returned status code 200) will be verified for integrity by comparing the MD5 hash of the content. If the hash doesn't match, we look for:

  • The presence of the title of the website used to test the integrity
  • The words 'login' or 'authorization' to detect a potential login page

If this fail to recognized the page, we simply save the title of the page.