Pinned Repositories
4mc
4mc - splittable lz4 and zstd in hadoop/spark/flink
aircompressor
A port of Snappy, LZO, LZ4, and Zstandard to Java
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
arctic
Arctic is a streaming lake warehouse service open sourced by NetEase
ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
aresdb
A GPU-powered real-time analytics storage and query engine.
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
kkxiaotikk's Repositories
kkxiaotikk/arctic
Arctic is a streaming lake warehouse service open sourced by NetEase
kkxiaotikk/asterixdb
Mirror of Apache AsterixDB
kkxiaotikk/blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
kkxiaotikk/calcite
Apache Calcite
kkxiaotikk/datafusion-objectstore-hdfs
HDFS based on Java implementation as a remote ObjectStore for DataFusion
kkxiaotikk/datafusion-substrait
Experimental support for serializing DataFusion plans using substrait
kkxiaotikk/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
kkxiaotikk/delta-rs
A native Rust library for Delta Lake, with bindings into Python and Ruby.
kkxiaotikk/dgraph
Native GraphQL Database with graph backend
kkxiaotikk/euler
A distributed graph deep learning framework.
kkxiaotikk/feature-engineering-and-feature-selection
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
kkxiaotikk/flink-table-store
An Apache Flink subproject to provide storage for dynamic tables.
kkxiaotikk/fs-hdfs
kkxiaotikk/gluten
kkxiaotikk/gpdb
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
kkxiaotikk/gporca
A modular query optimizer for big data
kkxiaotikk/influxdb_iox
Pronounced (influxdb eye-ox), short for iron oxide. This is the new core of InfluxDB written in Rust on top of Apache Arrow.
kkxiaotikk/javalin
A simple and modern Java and Kotlin web framework
kkxiaotikk/LakeSoul
A Table Structure Storage to Unify Batch and Streaming Data Processing
kkxiaotikk/librdkafka
The Apache Kafka C/C++ library
kkxiaotikk/mysql-connector-cpp
MySQL Connector/C++ is a MySQL database connector for C++. It lets you develop C++ and C applications that connect to MySQL Server.
kkxiaotikk/neon
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, branching, and bottomless storage.
kkxiaotikk/opendal
OpenDAL: Access data freely, painlessly, and efficiently
kkxiaotikk/postgres-archive
kkxiaotikk/roapi
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
kkxiaotikk/serde
Serialization framework for Rust
kkxiaotikk/sqlparser-rs
Extensible SQL Lexer and Parser for Rust
kkxiaotikk/stonedb
StoneDB is an open-source, MySQL HTAP and MySQL-native database for oltp, real-time analytics
kkxiaotikk/thrift
Apache Thrift
kkxiaotikk/tokio
A runtime for writing reliable asynchronous applications with Rust. Provides I/O, networking, scheduling, timers, ...