"hanoi.vn" not parsed correctly

Question

Closed this issue a year ago · 1 comments

Hello,

"registered_domain" not being parsed correctly for "hanoi.vn" specifically.
versions:
tldextract : 3.4.4
python 3.11.3

Actual behavior

>>> tldextract.extract("hanoi.vn")
ExtractResult(subdomain='', domain='', suffix='hanoi.vn')

>>> tldextract.extract("hanoi.vn")
ExtractResult(subdomain='', domain='hanoi', suffix='vn')

Interestingly "hanoia.vn" works as expected.

>>> tldextract.extract("hanoia.vn")
ExtractResult(subdomain='', domain='hanoia', suffix='vn')

Answer 1 · 2023-06-30T09:09:48.000Z

issue solved.
hanoi.vn was recently added to https://publicsuffix.org/list/public_suffix_list.dat so "hanoi.vn" is correctly identified as a suffix