usage: download_page.py [-h] [-o OUTPUT] [-v] url
Downloads the page specified from the URL.
positional arguments:
url the page to download
optional arguments:
-h, --help show this help message and exit
-o OUTPUT, --output OUTPUT
Saves the download page to a file.
-v, --verbose increase output verbosity
This script requires download_page.py to be in the same working folder.
usage: find_out_when_page_updates.py [-h] [-d DURATION] [-p PERIOD] [-f FILE] [-o OUTPUT] [-v] [urls [urls ...]]
Finds out when a page updates.
positional arguments:
urls the page(s) to find out when it changes
optional arguments:
-h, --help show this help message and exit
-d DURATION, --duration DURATION
How long to run the script for, in hours. If not specified, it will run forever.
-p PERIOD, --period PERIOD
period of checking, in seconds. Default is 60 seconds.
-f FILE, --file FILE gets list to check from a file
-o OUTPUT, --output OUTPUT
Saves the download page to a file. Default is ./out
-v, --verbose increase output verbosity
$ python download_page.py google.com -o out
This will download google.com, and save it to a folder called 'out'.
$ python find_out_when_page_updates.py https://www.shobserver.com/journal/getHomePage.htm http://paper.people.com.cn/rmrb/ -d 24 -p 5 -o out -v
This will download the two pages mentioned, run for 24 hours, download every 5 seconds, and save the downloaded files to a folder called 'out', in verbose mode.