/abpcrawler

Website crawler for Adblock Plus

Primary LanguagePython

abpcrawler

Firefox extension that loads a range of websites and records which elements are filtered by Adblock Plus.

Requirements

Running

Execute the following:

./run.py -b /usr/bin/firefox urls.txt outputdir

This will run the specified Firefox binary to crawl the URLs from urls.txt (one URL per line). The resulting data and screenshots will be written to the outputdir directory. Firefox will close automatically once all URLs have been processed.

Optionally, you can provide the path to the Adblock Plus repository - Adblock Plus will no longer be downloaded then.

License

This Source Code is subject to the terms of the Mozilla Public License version 2.0 (the "License"). You can obtain a copy of the License at http://mozilla.org/MPL/2.0/.