ParzivalHack/T-Crawl

A simple web crawler (spider) written in Python

PythonGPL-3.0

What is Web Crawling?

A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses websites for the purpose of Web indexing (web spidering).

Tool info

T-Crawl is a web crawler (spider) written in Python. This is how it works:

It downloads the HTML from a webpage.
It parses the HTML to extract links.
It prints the links collected.

Installation of T-Crawl

apt update && apt upgrade
pkg install python
pkg install git
pkg install toilet
pip install requests
pip install bs4
git clone https://github.com/ParzivalHack/T-Crawl

Usage

cd T-Crawl
chmod +x T-Crawl.py
python T-Crawl.py

License

This tool is under the GPL v.3 License.

© 2022 Tommaso Bona