skrapeit/skrape.it

[QUESTION] Charset not applied and fetch question marks instead

edenhalfon opened this issue · 2 comments

I want to achieve html and parse the links and their title. The title is on Hebrew (RTL lang).
Instead of getting the real title I get "????" instead. What am I missing here? (I tried changing the charset but usually UTF-8 is good enough)

Code Sample
response { htmlDocument { // parsed Doc is available here a { withAttributeKey = "data-item" findAll { println( it.attribute("href")) } } } }

hey, thx for finding this.
could you provide a html snippet or even the url you want parse from?

Hey, Sure the link is:
https://www.htzone.co.il/benefit/562/דלי-ריי/?sale_id=62
the language in the middle is Hebrew.