stamparm/maltrail

[Feature Request] non-domain indicators parser for /maltrail-malware-domains.txt

MikhailKasimov opened this issue · 7 comments

Inspired by: stamparm/aux#8

[i] Target: https://raw.githubusercontent.com/stamparm/aux/master/maltrail-malware-domains.txt
[✓] Status: Retrieval successful
[✓] Parsed 290973 exact domains and 0 ABP-style domains (ignored 13 non-domain entries)

To have a filter\parser for non-domain indicators when collecting domains from maltrail static trails to /maltrail-malware-domains.txt:

maltrail static trails --> [non-domain indicators] --> /maltrail-malware-domains.txt

In case of non-domain indicator != 0, to send emails to MS and MK (e.g. ...| mail -s "non-domain entries found" email@email.email).

This should speed up correction, if junk/orphan records are going to get incorporate into /maltrail-malware-domains.txt.

Domain validator (preliminary): ^((?!-)[A-Za-z0-9-]{1,63}(?<!-)\.)+[A-Za-z]{2,7}$

mail.tsinghua.institute <-- ^((?!-)[A-Za-z0-9-]{1,63}(?<!-)\.)+[A-Za-z]{2,9}$

https://data.iana.org/TLD/tlds-alpha-by-domain.txt <-- ^((?!-)[A-Za-z0-9-]{1,63}(?<!-)\.)+[A-Za-z-0-9]{2,24}(?<![0-9-])$

Relative reports on invalid domains:

stamparm/aux#8
stamparm/aux#10

@stamparm Any implementations for /maltrail-malware-domains.txt ?