Selectolax couldn't load large html string (87MB) but lxml could

Question

Selectolax couldn't load large html string (87MB) but lxml could

Closed this issue 8 months ago · 3 comments

In my scraper, i am dealing with large html strings and now i have run into an issue with selectolax not able to load my html string which is about 87 mb in size. I tried using lxml and it was able to load it in about 2 seconds.

Answer 1 · 2024-01-26T13:39:08.000Z

That limit is artificial; I've increased it. Some people tried to load 5000 MB of binary data by accident and complained about it in the past.

Answer 2 · 2024-01-27T05:09:49.000Z

The most recent version can now accept up to 2.4GB of HTML. It will be on pypi in a few hours.

Answer 3 · 2024-01-30T13:18:53.000Z

I am still getting this error even with the update