john-kurkowski/tldextract

Suffix issue

avramd02 opened this issue · 2 comments

When trying to strip certain urls its returning the wrong suffix.

tldextract.extract("http://blahblah.uk.com/blah/blah")
ExtractResult(subdomain='blahblah', domain='uk', suffix='com')

When in reality it should be returning domain = blahblah and suffix = uk.com
uk.com is in the public suffix list.

That suffix is pretty far down the list, on line 10970, so it's in the private domains section. See the FAQ.

I see, thanks.