apache-parquet
There are 35 repositories under apache-parquet topic.
aws/aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
polarsignals/frostdb
❄️ Coolest database around 🧊 Embeddable column database written in Go.
opengeospatial/geoparquet
Specification for storing geospatial vector data (point, line, polygon) in Parquet
mukunku/ParquetViewer
Simple windows desktop application for viewing & querying Apache Parquet files
visgl/loaders.gl
Loaders for big data visualization. Website:
aloneguid/parquet-dotnet
Fully managed Apache Parquet implementation
kylebarron/parquet-wasm
Rust-based WebAssembly bindings to read and write Apache Parquet data
developmentseed/lonboard
A Python library for fast, interactive geospatial vector data visualization in Jupyter.
cldellow/sqlite-parquet-vtable
A SQLite vtable extension to read Parquet files
G-Research/ParquetSharp
ParquetSharp is a .NET library for reading and writing Apache Parquet files.
google/space
Unified storage framework for the entire machine learning lifecycle
commoncrawl/cc-index-table
Index Common Crawl archives in tabular format
cldellow/csv2parquet
Convert a CSV to a parquet file.
codename-hub/php-parquet
PHP implementation for reading and writing Apache Parquet files/streams
contactsunny/Parquet_File_Writer_POC
This is a simple Java POC to create Parquet files This is a Spring Boot project.
Jocoon/php-parquet
PHP implementation for reading and writing Apache Parquet files/streams. NOTICE: Please migrate to https://github.com/codename-hub/php-parquet.
kat-co/cl-apache-arrow
This is a library for working with Apache Arrow and Parquet data.
renesugar/FileConvert
Converts between file formats such as CSV and Parquet
rvilla87/ETL-PySpark
ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)
cldellow/parquet-metadata
Dump metadata about a Parquet file.
dantrim/parquet-writer
A C++ library for easily writing Parquet files containing columns of (mostly) any type you wish.
spaghettifunk/norman
Realtime distributed OLAP datastore, designed to answer OLAP queries with low latency written in Go. In Active development
firelink-data/evolution
🦖 Efficiently evolve your old fixed-length data files into more modern file formats, fully parallelized!
marwan116/aws-parquet
a toolkit that provides an object-oriented interface for working with parquet datasets on AWS
adriens/endoflife-date-snapshots
Daily consolidated and enriched snapshots of endoflife.date
sagarparikh2013/Amazon-Product-Reviews-Exploratory-Analysis
Exploratory Analysis of Amazon Product Reviews Dataset comprising of various categories spanning over 14 years
banadiga/access-control-reporting-system
Access control reporting system
DevGlitch/AWS_Lambda_Postgres2Parquet
Streamline Amazon RDS PostgreSQL to Parquet conversion via AWS Lambda and GitHub Actions for effortless S3 storage.
NarayanSchuetz/edf2parquet
Simple utility package to convert EDF/EDF+ files into Apache Parquet format.
povstenko/parquet_convertor
🔄 Convert csv to parquet and explore parquet data structure
Srking501/csc8101_coursework
A summative coursework for CSC8101 Engineering for AI
amoeba/arrow-cpp-examples
Various Arrow C++ examples
ASKRAJPUT5/multipleDestinations
Sending data to multiple location
renesugar/capnpc-parquet
A Cap'n Proto compiler plugin to create a Parquet schema from a Cap'n Proto schema