Anorov/cloudflare-scrape

requests.exceptions.HTTPError: 403 Client Error: Forbidden for url: https://www.selfridges.com/

rnrnstar2 opened this issue · 0 comments

Before creating an issue, first upgrade cfscrape with pip install -U cfscrape and see if you're still experiencing the problem. Please also confirm your Node version (node --version or nodejs --version) is version 10 or higher.

Make sure the website you're having issues with is actually using anti-bot protection by Cloudflare and not a competitor like Imperva Incapsula or Sucuri. And if you're using an anonymizing proxy, a VPN, or Tor, Cloudflare often flags those IPs and may block you or present you with a captcha as a result.

Please confirm the following statements and check the boxes before creating an issue:

  • I've upgraded cfscrape with pip install -U cfscrape
  • I'm using Node version 10 or higher
  • The site protection I'm having issues with is from Cloudflare
  • I'm not using Tor, a VPN, or an anonymizing proxy

Python version number

Run python --version and paste the output below:

python --version                                                                 【master | merge】
Python 3.7.7
(python_modules) 

cfscrape version number

Run pip show cfscrape and paste the output below:

●pip show cfscrape                                                                【master | merge】
Name: cfscrape
Version: 2.1.1
Summary: A simple Python module to bypass Cloudflare's anti-bot page. See https://github.com/Anorov/cloudflare-scrape for more information.
Home-page: https://github.com/Anorov/cloudflare-scrape
Author: Anorov
Author-email: anorov.vorona@gmail.com
License: UNKNOWN
Location: /Users/rnrnstar/opt/anaconda3/envs/python_modules/lib/python3.7/site-packages
Requires: requests
Required-by: 
(python_modules) 

Code snippet involved with the issue

>>> cfscrape.get_tokens("https://www.selfridges.com")

ERROR:root:'https://www.selfridges.com' returned an error. Could not collect tokens.
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/Users/rnrnstar/opt/anaconda3/envs/python_modules/lib/python3.7/site-packages/cfscrape/__init__.py", line 384, in get_tokens
    resp.raise_for_status()
  File "/Users/rnrnstar/opt/anaconda3/envs/python_modules/lib/python3.7/site-packages/requests/models.py", line 941, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 403 Client Error: Forbidden for url: https://www.selfridges.com/
>>> 

Complete exception and traceback

(If the problem doesn't involve an exception being raised, leave this blank)


URL of the Cloudflare-protected page

[LINK GOES HERE]

URL of Pastebin/Gist with HTML source of protected page

[LINK GOES HERE]