sapser's Stars
StarRocks/starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
apache/flink-cdc
Flink CDC is a streaming data integration tool
apache/iceberg
Apache Iceberg
haileys/mrb-rs
Safe, low level mruby bindings for Rust
taosdata/TDengine
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
pingcap/tidb
TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://www.pingcap.com/tidb-serverless/
apache/calcite
Apache Calcite
hortonworks-spark/spark-atlas-connector
A Spark Atlas connector to track data lineage in Apache Atlas
analysys/presto-hbase-connector
presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。
hanxt/boot-launch
spring boot 2.x 课程代码
apache/linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
WeBankFinTech/Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
apache/dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
crawlab-team/crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
prestodb/presto
The official home of the Presto distributed SQL query engine for big data
mantoudev/atlas_cn
Atlas官方文档中文版
996icu/996.ICU
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
Micropoor/Micro8
Gitbook
Tencent/APIJSON
🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can be customized by Frontend(Client) users
dromara/hutool
🍬A set of tools that keep Java sweet.
apache/zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
apachecn/spark-doc-zh
Apache Spark 官方文档中文版
keepfool/vue-tutorials
Let you insight into the Vue.js
yourtion/DataminingGuideBook-Codes
《面向程序员的数据挖掘指南》源码
alibaba/tengine
A distribution of Nginx with some advanced features
cloudera/impyla
Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
coleifer/peewee
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
nginxinc/nginx-ldap-auth
Example of LDAP authentication using ngx_http_auth_request_module
lustlost/ubackup
此系统解决游族2w+个数据库实例,日均大概40w+个备份文件,40TB+数据量(包括mysql,redis,ssdb)的异地灾备