Pinned Repositories
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
spark
Apache Spark - A unified analytics engine for large-scale data processing
velox
A composable and fully extensible C++ execution engine library for data management systems.
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
cs-self-learning
计算机自学指南
folly
An open-source C++ library developed and used at Facebook.
gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
presto
The official home of the Presto distributed SQL query engine for big data
kevincmchen's Repositories
kevincmchen/angel
A Flexible and Powerful Parameter Server for large-scale machine learning
kevincmchen/antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
kevincmchen/arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
kevincmchen/cs-self-learning
计算机自学指南
kevincmchen/folly
An open-source C++ library developed and used at Facebook.
kevincmchen/gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
kevincmchen/hadoop-cos
hadoop-cos为Apache Hadoop、Spark以及Tez等大数据计算框架集成提供支持,可以像访问HDFS一样读写存储在腾讯云COS上的数据。同时也支持作为Druid等查询与分析引擎的Deep Storage
kevincmchen/hive
Apache Hive
kevincmchen/hive-testbench
kevincmchen/incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
kevincmchen/linux-command
Linux命令大全搜索工具,内容包含Linux命令手册、详解、学习、搜集。https://git.io/linux
kevincmchen/presto
The official home of the Presto distributed SQL query engine for big data
kevincmchen/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
kevincmchen/nimble
New file format for storage of large columnar datasets.
kevincmchen/olap-performance
OLAP Database Performance Tuning Guide
kevincmchen/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
kevincmchen/remote-jobs
A list of semi to fully remote-friendly companies (jobs) in tech.
kevincmchen/spark
Apache Spark - A unified analytics engine for large-scale data processing
kevincmchen/velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.