stormcrawler
There are 6 repositories under stormcrawler topic.
apache/incubator-stormcrawler
A scalable, mature and versatile web crawler based on Apache Storm
DigitalPebble/stormcrawler-docker
Resources for running StormCrawler with Docker services
sebastian-nagel/warc-crawler
Process web archives (WARC format) with StormCrawler and index content into Elasticsearch or Solr
DigitalPebble/ansible-storm
Ansible playbook for deploying a Storm cluster
DigitalPebble/benchmark
StormCrawler topology to evaluate the performance of different backends and configurations