john-kurkowski/tldextract

tldextract: incorrect resolution of www.nhs.uk

BigAlOne opened this issue · 2 comments

Hi

Could you please test the tldextract library with the following url: www.nhs.uk

It returns ExtractResult(subdomain='', domain='www', suffix='nhs.uk'). And I dont believe this is correct.

Thanks
AA

I believe the tool does exactly what it says it will do -> I found the nhs.uk suffix in the public suffixes list. If you believe that is an issue you should raise complaint with them.

image

Yup, @JanoutV called it. The behavior is correct. See also the FAQ.