salimk/Rcrawler

Depricated Function

delta2echo opened this issue · 1 comments

I was trying to run Rcrawler with this command:

Rcrawler(Website = CrawlURL, no_cores = 2, no_conn = 2,MaxDepth = 1, DIR = storagePath,RequestsDelay = 2, Obeyrobots = TRUE,URLlenlimit = 225, Timeout = 8, Useragent = userAgent,ExtractAsText = TRUE)

and I get this output:
In process : 1..
Progress: 100.00 % : 1 parssed from 1 | Collected pages: 1 | Level: 1

  • Check INDEX dataframe variable to see crawling details
  • Collected web pages are stored in Project folder
  • Project folder name : g3journal.org-222130
  • Project folder path : F:\DataScience\school\class\Project1\Storage/g3journal.org-222130
    Warning message:
    'xml_find_one' is deprecated.
    Use 'xml_find_first' instead.
    See help("Deprecated")
danrg commented

It could be a bug with Rcrawler. Please include a small example that reproduces the problem. The above command includes some variables with unknown values, which makes it difficult for someone else to try it out.