Canop/rhit

Bot traffic

kevinburke opened this issue · 1 comments

I understand this is difficult / impossible to solve with perfect fidelity, but any option to try to filter out bot traffic, even just obvious stuff like "has a GoogleBot user agent," would be super useful. Right now I am using a grep pipeline to filter these out, anything built in to the tool would be better than that.

Canop commented

Maybe having rhit use sets of regex filters, either internal or given as parameters ?