Pinned Repositories
4mc
4mc - splittable lz4 and zstd in hadoop/spark/flink
aircompressor
A port of Snappy, LZO, LZ4, and Zstandard to Java
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
arctic
Arctic is a streaming lake warehouse service open sourced by NetEase
ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
aresdb
A GPU-powered real-time analytics storage and query engine.
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
kkxiaotikk's Repositories
kkxiaotikk/ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
kkxiaotikk/awadb
AI Native database for embedding vectors
kkxiaotikk/buck2
Build system, successor to Buck
kkxiaotikk/buf
A new way of working with Protocol Buffers.
kkxiaotikk/ceresdb
CeresDB is a high-performance, distributed, cloud native time-series database.
kkxiaotikk/cudf
cuDF - GPU DataFrame Library
kkxiaotikk/DataFrame
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
kkxiaotikk/drill
Apache Drill is a distributed MPP query layer for self describing data
kkxiaotikk/dvc
🦉 Data Version Control | Git for Data & Models | ML Experiments Management
kkxiaotikk/flink-learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
kkxiaotikk/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
kkxiaotikk/hopsworks
Hopsworks - Data-Intensive AI platform with a Feature Store
kkxiaotikk/hyrise
Hyrise is a research in-memory database.
kkxiaotikk/incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
kkxiaotikk/junodb
JunoDB is PayPal's home-grown secure, consistent and highly available key-value store providing low, single digit millisecond, latency at any scale.
kkxiaotikk/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
kkxiaotikk/mlrun
Machine Learning automation and tracking
kkxiaotikk/modin
Modin: Scale your Pandas workflows by changing a single line of code
kkxiaotikk/nimble
New file format for storage of large columnar datasets.
kkxiaotikk/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
kkxiaotikk/polars
Fast multi-threaded, hybrid-out-of-core DataFrame library in Rust | Python | Node.js
kkxiaotikk/rondb
This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.
kkxiaotikk/sqlalchemy
The Database Toolkit for Python
kkxiaotikk/tablesaw
Java dataframe and visualization library
kkxiaotikk/tugraph-db
TuGraph is a high performance graph database.
kkxiaotikk/ustore
Replacing MongoDB, Neo4J, and Elastic with 1 transactional database. Features: zero-copy semantics, swappable backends, bindings for C, C++, Python, Java, GoLang
kkxiaotikk/wasmer
🚀 The leading WebAssembly Runtime supporting WASI and Emscripten
kkxiaotikk/xorbits
Scalable Python data science, in an API compatible & lightning fast way.
kkxiaotikk/xtensor
C++ tensors with broadcasting and lazy computing
kkxiaotikk/ytsaurus
YTsaurus is a scalable and fault-tolerant open-source big data platform.