r-rudra/tidycells

Create Example Data on which read_cells works (as expected)

bedantaguru opened this issue · 6 comments

The algorithm currently working on certain types of data. Definitely, It is yet not rendering the expected output on several well-known data. Add support for those.

As of now supported. (CRAN V0.2.1)

Add example given in nacnudus/unpivotr#31 also mentioned here

However, it will never be possible to directly support this type of data. Rather create/try the ways to solve it.

Many economic indicators in ABS has been migrated to SDMX ready structures.
Example :
https://www.abs.gov.au/AUSSTATS/abs@.nsf/DetailsPage/5206.0Sep%202019?OpenDocument

However, there are many tables having similar types as given in {tidyABS}.
Example :
https://www.abs.gov.au/AUSSTATS/abs@.nsf/DetailsPage/6224.0.55.001June%202019?OpenDocument

Just checked

unpivotr\inst\extdata\worked-examples.xlsx@pivot-centre-aligned:G8 -->
the cell can not be mapped correctly. This is because line separator plays a major role here.
Maybe some method need to deploy to tweak the automatic understanding.

Following is similar to ABS datasets.

unpivotr\inst\extdata\worked-examples.xlsx@pivot-hierarchy

These [somehow] works
However, may need to place heuristic for uniform block patterns.
unpivotr\inst\extdata\worked-examples.xlsx@

  • pivot-repeated-headers
  • pivot-header-within-data
  • notes
  • transposed
  • pivot-annotations

These are not working properly

  • small-multiples (mainly for corner join cases)
  • implied-multiples (don't know why few of them getting emptied. Need to check!)