/spider

The fastest web crawler written in Rust. Maintained by @a11ywatch.

Primary LanguageRustMIT LicenseMIT

Spider

Build Status Crates.io Documentation Rust Discord chat

Website | Guides | API Docs | Chat

The fastest web crawler and indexer. Foundational building blocks for data curation workloads.

  • Concurrent
  • Streaming
  • Decentralization
  • Headless Chrome Rendering
  • HTTP Proxies
  • Cron Jobs
  • Subscriptions
  • Smart Mode
  • Blacklisting and Budgeting Depth
  • Dynamic AI Prompt Scripting Headless
  • Changelog

Getting Started

The simplest way to get started is to use the Spider Cloud hosted service. View the spider or spider_cli directory for local installations. You can also use spider with Node.js using spider-nodejs and Python using spider-py.

Benchmarks

See BENCHMARKS.

Examples

See EXAMPLES.

License

This project is licensed under the MIT license.

Contributing

See CONTRIBUTING.