/proxy-scraper-checker

HTTP, SOCKS4, SOCKS5 proxies scraper and checker with rich functionality.

Primary LanguagePythonMIT LicenseMIT

proxy-scraper-checker

Screenshot

HTTP, SOCKS4, SOCKS5 proxies scraper and checker.

  • Asynchronous.
  • Uses regex to search for proxies (ip:port format) on a web page, which allows you to pull out proxies even from json without making any changes to the code.
  • Supports determining the geolocation of the proxy exit node.
  • Can determine if a proxy is anonymous.

For a version that uses Python's built-in logging instead of rich, see the simple-output branch.

Usage

  • Install Python (Windows 7 requires Python 3.8.X). During installation, be sure to check the box Add Python to PATH.
  • Download and unpack the archive with the program.
  • Install dependencies from requirements.txt (python -m pip install -U -r requirements.txt on the command line).
    • If you want to improve the performance, you can also install extra dependencies. See aiohttp documentation.
  • Edit config.py according to your preference.
  • Run main.py.

Folders description

When the script finishes running, the following folders will be created (this behavior can be changed in the config):

  • proxies - proxies with any anonymity level.
  • proxies_anonymous - anonymous proxies.
  • proxies_geolocation - same as proxies, but including exit-node's geolocation.
  • proxies_geolocation_anonymous - same as proxies_anonymous, but including exit-node's geolocation.

Geolocation format is ip:port::Country::Region::City.

Buy me a coffee

Ask for details in Telegram or VK.

License

MIT