Reuters scrapping problem.
Closed this issue · 1 comments
ostapkharysh commented
The scrapping algo retrieves all the links at the current page. The problem happens after each of these links are being processed. At one of the latests links the script does‘t move to the next one but process the same one for 10 times with 404 error.
The problem seems to be not in the Exceptions, because several links were also unable to reach and program continues with the next.
ostapkharysh commented
Decided not to move further