ikreymer/webarchiveplayer

Getting 'permanently moved' results on 200/OK WARC entries

christianleger opened this issue · 3 comments

I'm getting 'permanently moved to here' results when loading up a WARC I made. The word 'here' is a link, and when I click it, it just reloads the same 'permanently moved' page. This would make sense if the URL in question had a result of 30x in my WARC, however I'm getting this for some 200/OK request/responses in my WARC. Any idea why this would be?

Thank you for your time.

One more observation: I was able to view the record in replay, instead of getting the redirect, by changing the timestamp. It seems like if the response record has the same timestamp as the request record, then I get the redirect!

More observation: changing record times seems to help sometimes, but not reliably so.

Overall, although the WARC standard says all records (in a given capture) need to have the same timestamp, giving all records the same timestamp often results in 'permanently moved' pages, while providing different timestamps occasionally makes pages readable.

@Ruggy, can you upload a WARC that exhibits this behavior on replay?