raviqqe/muffet

Systematic timeout for microsoft.com

aslafy-z opened this issue · 1 comments

I'm unable to scan any microsoft.com pages, however, I can curl them.

$ muffet https://www.microsoft.com --verbose --one-page-only --max-response-body-size=1000000000000000 --timeout 60
failed to fetch root page: timeout

You need to masquerade the user-agent header in some way (e.g. --header 'user-agent: Curl'.)

> muffet --header 'user-agent: Curl' --one-page-only -v https://microsoft.com
https://www.microsoft.com/
        200     http://www.microsoft.com/en/us/default.aspx?redir=true