/spiderino

Php Web Crawler

Primary LanguagePHPGNU General Public License v3.0GPL-3.0

Spiderino

Web Crawler written in Php

###What is Spiderino

Spiderino is a web crawler written in PHP that take in input one or more URL seeds, one or more keywords, scan seeds and other URLs founded and save pages (or files) that contain almost one input keyword.

###Installation & configuration Very easy: download source and start it with php command!

###How start Start crawler is very simple ( SEED = url string , TIME_SIM = int (in minute), KEYWORD = string ) :

    php ./spiderino SEED1 [SEED2 SEED3 .. ] -t TIME_SIM KEYWORD1 [KEYWORD2 KEYWORD3 .. ]

###Credits Have participated at project (Alphabetical Order):