This library is for searching domain names in raw text data. First it searches domain-like strings
using simple regexp. Then it uses list of top level domain names to remove names which could be a
domain name i.e. last segment is not top level domain name. TLD list is provided by
tldextract library, technicall that means that
when you will use find_domains
in first time it will download top level domains list (this is
tldextract behaviour).
pip install -U find_domains
from find_domains import find_domains
data = """
foo bar google.com foo.bar.com domain.info
превед-медвед.рф
"""
for domain in find_domains(data):
print(domain)