/ParquetReader

Parquet reader code in Java

Primary LanguageJava

Build Status

Parquet Reader

Boilerplate java code for reading Apache Parquet file without installing and configuring libraries like Apache Spark/pandas.

Helps understand

  • Basic organization of data and schema inside a parquet file
  • Use of core Apache libraries only

Work in progress

  • Can be packaged to a jar (similar to parquet-tools, but leaner) for quick inspection of parquet files
  • Exception and edge case handling