ZJONSSON/parquetjs

How to specify schema while reading file from S3 bucket

Opened this issue · 2 comments

How to specify schema while reading file from S3 bucket? I have S3 object which contains one of the column as DATE, when I read without schema, I get the DATE as integer value?

{
VERSION_KEY: 2,
VERSION_DESCRIPTION: 'XXX',
EFFECTIVE_DATE: '43200000000000'
}

I am using the method
const reader = await parquet.ParquetReader.openS3(client, params);

@dkommineni ... Did you find a workaround for this? The number isn't a timestamp so unless I am missing something, there is no way to turn the number into a date.

Just in case someone else comes upon this .. my issue stemmed from the field being an int96 which is no longer officially part of the Parquet spec and so, not supported by this library which makes sense.