rc0r/AvivoreXT

Apply multiple regexes to tweets

rc0r opened this issue · 0 comments

rc0r commented

Don't stop processing a tweet after the first regex matched. Instead continue with all following regexes. This allows to extract multiple bits of information from a single tweet. This might be handy if a single tweet contains multiple data sets AvivoreXT is monitoring (f.e. if checking for URLs and phone numbers don't submit only the first found item (URL OR phone number) into db).

Side notes:

  • modify __db_dup_check accordingly
  • database layout modification needed? (extracting multiple items of same type from a single tweet; f.e. multiple URLs)
  • regex definition in config file may need modification to avoid false positives, ...