/RuiJi.Net

RuiJi.Net is a dotnet distributed web crawler framework written in c#(分布式爬虫).Major features include distribute crawler, distribute extractor ,managed cookie and extact ajax webpage, support ip polling that using the server public network address and proxy server.Crawler support custom cookie,custom headers ,mime detect and embed phantomjs,so you can run js. Extractor use ruiji expression and much selector can be used to clear data.

Primary LanguageC#OtherNOASSERTION

No issues in this repository yet.