rushter/selectolax

Segmentation fault with Lexbor engine

Closed this issue · 2 comments

Hi,

I've encountered a page that causes segmentation fault when using Lexbor engine (Modest does not crash).
The page is https://www.radiokerry.ie/sport/kerry-racing-news-65-333925.

I'm using Python 3.9/3.10, selectorlax v0.3.14.

Steps to reproduce:

import requests
r = requests.get('https://www.radiokerry.ie/sport/kerry-racing-news-65-333925')
import selectolax
tree = selectolax.lexbor.LexborHTMLParser(r.text)
# tree = selectolax.parser.HTMLParser(r.text) # Modest does not crash
# tree.strip_tags(['script', 'style', 'svg'], recursive=False) # try some operations
tree.html

[1]    88411 segmentation fault  python

Kerry Racing News _ RadioKerry.ie.html.txt

Thank you for your work!

@captify-dpawlowski

Fixed in lexbor library.
Thanks for the report!

@rushter
Please, try fixed code in selectolax.

Sorry for late update, this should be fixed.