commoncrawl/ia-web-commons

[WAT] Add rel attribute to A@/href links

Closed this issue · 1 comments

The rel attribute isn't extracted for A (and AREA) hyperlinks. The link types specified are useful, e.g., nofollow. Also check whether other attributes are worth to be extracted.

Included in August crawl (CC-MAIN-2017-34).