Skip deleted observations in SAS7BDAT files
peiyaoli2 opened this issue · 10 comments
Hi, it seems like ReadStat also has a similar issue as pandas-dev/pandas#15963, where deleted observations are still being counted in rows & being read. Here is a sample output from ReadStat on the same file as the other issue:
readstat /Users/peiyao.li/Downloads/datetime.sas7bdat
Format: SAS data file (SAS7BDAT)
Columns: 5
Rows: 5
Table name: DATETIME
Format version: 8
Text encoding: WINDOWS-1252
Byte order: little-endian
Timestamp: 08 Apr 2017 09:51
And the screenshot from the other thread is shown that there are 3 rows and 2 deleted observations.
@evanmiller can you take a look at this? Thanks!
Hi, I would need a sample file to address this issue.
It looks like this code may help
Hi! Here is the datetime.sas7bdat used to repro this:
datetime.sas7bdat.zip
It's the same file as the pandas-dev/pandas#15963 issue
@peiyaoli2 Thanks for the link to the file. Based on the code that I linked, it looks like skipping deleted rows will be a bit of work. I'll leave this issue open, but I'm going to change it from a "bug" to an "enhancement".
@evanmiller Hi! I would like to know if there is any ETA on when this feature can be supported. Thanks!
@evanmiller Hi. My client is facing this issue and needs an immediate fix. So I'd also like to know if there is any ETA for this to be implemented. Thanks and regards.
@evanmiller Hi, sorry for bothering you as I send you again, but could you provide ETA for this to be implemented?
No ETA
@evanmiller Could you please implement this enhancement, since my customer using readstat repeatedly asks me to do that?