HTTP, SOCKS4, SOCKS5 proxies scraper and checker.
- Asynchronous.
- Uses regex to search for proxies (ip:port format) on a web page, which allows you to pull out proxies even from json without making any changes to the code.
- Supports determining the geolocation of the proxy exit node.
- Can determine if a proxy is anonymous.
For a version that uses Python's built-in logging
instead of rich, see the simple-output branch.
You can get proxies obtained using this script in monosans/proxy-list.
- Install Python (Windows 7 requires Python 3.8.X). During installation, be sure to check the box
Add Python to PATH
. - Download and unpack the archive with the program.
- Install dependencies from
requirements.txt
(cd
into the unpacked folder and runpython -m pip install -U -r requirements.txt
on the command line). - Edit
config.ini
according to your preference. - Run
main.py
(python main.py
on the command line).
When the script finishes running, the following folders will be created (this behavior can be changed in the config):
proxies
- proxies with any anonymity level.proxies_anonymous
- anonymous proxies.proxies_geolocation
- same asproxies
, but includes exit-node's geolocation.proxies_geolocation_anonymous
- same asproxies_anonymous
, but includes exit-node's geolocation.
Geolocation format is ip:port|Country|Region|City
.