webrecorder/replayweb.page

[Replay Bug]: Encoding of URL parameter &section to §ion prevents page from being displayed

Ben754444 opened this issue · 3 comments

ReplayWeb.page Version

v2.0.0

What did you expect to happen? What happened instead?

When viewing a page with "&sect" in the URL, the site will convert the "&sect" into the character §, which then errors as it cannot find the page.

image

In this image, the correct URL is shown in the page list and URL bar. The incorrect URL is shown on the error page.

Additionally, using &amp does not seem to work either despite the correct URL being shown:

image

Step-by-step reproduction instructions

  1. Archive a page with "&sect" in the URL
  2. Open in ReplayWeb and navigate to the page
  3. Observe a 404 page despite the fact that it is in the archive

Additional details

Note: this happens with basically any unicode character, for example: &#128514

Can you provide a specific example where this happens? You can email us directly if you don't want to make it public.
I tried archiving and replaying https://example.com/?a=b&section=0 using archiveweb.page and it worked fine

I was able to repro the error in the escape error in the 404 page, but not the actual loading.

Hey, I don't have access to the original archive right now sorry. From what I remember, it starts when you navigate from a page using a button on the actual site. So I assume it might be to do with href/setting the URL using JS. Once you've encountered the broken version once, it remains despite using the side panel.

Should be able to get the file on Wednesday and then I'll try send you a copy without any sensitive info in.