tautologistics/node-htmlparser

blog.kickstarter.com is not correctly parsed. cdata tags?

kirbysayshi opened this issue · 1 comments

http://blog.kickstarter.com/post/5770516169/new-projects-are-rolling-dice

I can't tell what's causing it, but the html gets parsed into approximately 16 root elements, instead of 2 (doctype, html). It could be CDATA, or the combination of crazy tumblr injected content near the bottom.

I believe this was due to modifications I had made similar as done in: #20. When I implemented my changes from #24, the problem went away.