salimk/Rcrawler

Rcrawler is only saving internal HTML pages

Mlabrams1 opened this issue · 0 comments

When utilizing the network analysis functionality, only the internal HTML pages identified in the Index file are stored as copies. This should store a copy of all HTML pages crawled, including those in NetwIndex, correct?

Rcrawler(Website = "https://github.com/salimk/Rcrawler/issues/new", MaxDepth = 2, no_cores = 4, no_conn = 4 , NetworkData = TRUE, NetwExtLinks =TRUE, statslinks = TRUE)