/WebCrawl

a python based depth first web page crawler used internally at Radii labs

Primary LanguagePython

webcrawl

this codebase connects to the common crawl database , finds domains, weblinks and other useful info and loads them into the private mongo db atlas instance