filiph/linkcheck

Linkcheck reports cf-challenge protected pages as missing

wom-bat opened this issue · 1 comments

We run a website that has links to publications. Any publication hosted on a site protected by cloudflare challenges reports 403 instead of something useful.

For instance, a link to https://royalsocietypublishing.org/doi/10.1098/rsta.2015.0401 reports as 403 instead of passing.

I don't know what to do about this; I want to check that the links are valid, but the 403 response prevents this.

Unfortunately, I don't think there's anything an automated tool can do about this. If a website provides the wrong HTTP code, that's that. I've seen this happen with YouTube (I think) and github and other major websites.