WGBH-MLA/AAPB2

Add rate limiting for site requests

afred opened this issue · 0 comments

afred commented

Because

Denying traffic based on info in the HTTP request and headers (IP range, user-agent, etc.) is reactionary (not preventive) and does not suffice in cases where there is no convenient way to specify an IP range and/or user-agent to target inconsiderate crawlers.

Background

Apache module mod_evasive can provide an effective speed limit for everybody and temporarily denying traffic (responding with 403) for those who exceed the speed limit.

This relieves us of having to track specific bot names or IP ranges.

Basic configuration of mod_evasive could limit requests to americanarchive.org to N visits per second... we just need to decide what the right value of N is.

Done when

  • Requests to americanarchive.org are limited to a certain number of visits (tbd) per second.
  • Rate limiting is part of provisioning new AAPB instances.