Pinned Repositories
spark
Apache Spark - A unified analytics engine for large-scale data processing
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
bigdata_race
ccf-bdci2022-datalake-contest
duckdb
DuckDB is an in-process SQL OLAP Database Management System
hezuojiao.github.io
He Zuojiao's personal site
Joiner
Joiner is a research project for sql join order tuning using reinforcement learning algorithm.
spark
Apache Spark - A unified analytics engine for large-scale data processing
starrocks
StarRocks is a next-gen sub-second MPP database for full analysis scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
oceanbase
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
hezuojiao's Repositories
hezuojiao/Joiner
Joiner is a research project for sql join order tuning using reinforcement learning algorithm.
hezuojiao/bigdata_race
hezuojiao/ccf-bdci2022-datalake-contest
hezuojiao/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
hezuojiao/duckdb
DuckDB is an in-process SQL OLAP Database Management System
hezuojiao/hezuojiao.github.io
He Zuojiao's personal site
hezuojiao/spark
Apache Spark - A unified analytics engine for large-scale data processing
hezuojiao/starrocks
StarRocks is a next-gen sub-second MPP database for full analysis scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
hezuojiao/Topn-go