extract links (href data) from html files/web pages.
pip install xurl
run the xurl -h
or xurl --help
for options
-a = append an URL to start of the links
-c = contain text (REGEX)
-C = not contain text (REGEX)
-q = quiet mode (do not print Errors/Warnings/Infos)
-v = version
xurl https://example.com
and same for the files
xurl path/to/file
search using regex
xurl https://example.com -c "section\-[1-10].*.[pdf|xlsx]"