r-rudra/tidycells

Extend Support for other type of files

Opened this issue · 2 comments

Once read_cells supports reading data as it is (see #7), extend it's the support for other types of files and objects.

  • If possible or required, depend on wand. For magic number based file type detection.
  • Support for all types mentioned in Data Import Cheat Sheet - by RStudio
  • xls : Already supported through readxl and xlsx
  • xlsx : Already supported through tidyxl
  • doc : Already supported through docxtractr but need a better alternative.
  • docx : Already supported through docxtractr
  • pdf : Already supported through tabulizer but need either better alternative or more clear guide to the user.
  • html, xml : Already supported through XML possibly need more clear guide and examples
  • All kind of delimited files : Few supported through read.csv and readr. Need to support other possiblities.
  • All haven types : Check all possible types under haven
    • SPSS
    • Stata
    • SAS

wand dependency is removed. Can be made optional to users.

See this --> the pakcage {rio}

Please add this to related project list.