apache-parquet
There are 39 repositories under apache-parquet topic.
aws/aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
polarsignals/frostdb
❄️ Coolest database around 🧊 Embeddable column database written in Go.
opengeospatial/geoparquet
Specification for storing geospatial vector data (point, line, polygon) in Parquet
mukunku/ParquetViewer
Simple Windows desktop application for viewing & querying Apache Parquet files
visgl/loaders.gl
Loaders for big data visualization. Website:
developmentseed/lonboard
A Python library for fast, interactive geospatial vector data visualization in Jupyter.
aloneguid/parquet-dotnet
Fully managed Apache Parquet implementation
kylebarron/parquet-wasm
Rust-based WebAssembly bindings to read and write Apache Parquet data
parquet-go/parquet-go
High-performance Go package to read and write Parquet files
cldellow/sqlite-parquet-vtable
A SQLite vtable extension to read Parquet files
G-Research/ParquetSharp
ParquetSharp is a .NET library for reading and writing Apache Parquet files.
google/space
Unified storage framework for the entire machine learning lifecycle
commoncrawl/cc-index-table
Index Common Crawl archives in tabular format
duo-rs/duo
A lightweight Logging and Tracing observability solution for Rust, built with Apache Arrow, Apache Parquet and Apache DataFusion.
codename-hub/php-parquet
PHP implementation for reading and writing Apache Parquet files/streams
cldellow/csv2parquet
Convert a CSV to a parquet file.
grouzen/zio-apache-parquet
Scala ZIO-powered Apache Parquet library
contactsunny/Parquet_File_Writer_POC
This is a simple Java POC to create Parquet files This is a Spring Boot project.
kat-co/cl-apache-arrow
This is a library for working with Apache Arrow and Parquet data.
Jocoon/php-parquet
PHP implementation for reading and writing Apache Parquet files/streams. NOTICE: Please migrate to https://github.com/codename-hub/php-parquet.
rvilla87/ETL-PySpark
ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)
renesugar/FileConvert
Converts between file formats such as CSV and Parquet
cldellow/parquet-metadata
Dump metadata about a Parquet file.
dantrim/parquet-writer
A C++ library for easily writing Parquet files containing columns of (mostly) any type you wish.
firelink-data/evolution
Efficiently evolve your old fixed-length data files into modern file formats.
spaghettifunk/norman
Realtime distributed OLAP datastore, designed to answer OLAP queries with low latency written in Go. In Active development
marwan116/aws-parquet
a toolkit that provides an object-oriented interface for working with parquet datasets on AWS
adriens/endoflife-date-snapshots
Daily consolidated and enriched snapshots of endoflife.date
DevGlitch/AWS_Lambda_Postgres2Parquet
Streamline Amazon RDS PostgreSQL to Parquet conversion via AWS Lambda and GitHub Actions for effortless S3 storage.
sagarparikh2013/Amazon-Product-Reviews-Exploratory-Analysis
Exploratory Analysis of Amazon Product Reviews Dataset comprising of various categories spanning over 14 years
banadiga/access-control-reporting-system
Access control reporting system
hperer02/Credit-risk-model
Discover a comprehensive approach to constructing credit risk models. We employ various machine learning algorithms like LightGBM and CatBoost, alongside ensemble techniques for robust predictions. Our pipeline emphasizes data integrity, feature relevance, and model stability, crucial elements in credit risk assessment.
NarayanSchuetz/edf2parquet
Simple utility package to convert EDF/EDF+ files into Apache Parquet format.
Srking501/csc8101_coursework
A summative coursework for CSC8101 Engineering for AI
amoeba/arrow-cpp-examples
Various Arrow C++ examples
tdiprima/arrow-and-parquet
Exploring Apache Arrow and Apache Parquet