content is extracted twice while using a regexp in the HTMLParser on HtmlSource field
Opened this issue · 0 comments
emmanuel-keller commented
Add a regexp to HTML Parser on the htmtSource field :
(?s)(<article(?:.?)?>(.?)</article>)
The content is extracted twice