bigdataxiaohan's Stars
justjavac/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍,欢迎投稿
0voice/interview_internal_reference
2023年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
apache/flink
Apache Flink
datawhalechina/pumpkin-book
《机器学习》(西瓜书)公式详解
taosdata/TDengine
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
great-expectations/great_expectations
Always know what to expect from your data.
datahub-project/datahub
The Metadata Platform for your Data Stack
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
will-che/flink-recommandSystem-demo
:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
chrislusf/gleam
Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly.
phodal/migration
《系统重构与迁移指南》手把手教你分析、评估现有系统、制定重构策略、探索可行重构方案、搭建测试防护网、进行系统架构重构、服务架构重构、模块重构、代码重构、数据库重构、重构后的架构守护
hedengcheng/tech
programming, database, distributed system
apache/linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Qihoo360/Quicksql
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
MarquezProject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
Tencent/Metis
Metis is a learnware platform in the field of AIOps.
microsoft/gctoolkit
Tool for parsing GC logs
fayson/cdhproject
hadoop各组件使用,持续更新
apache/incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
yuhaowow/GeneralConfig
all config file for code
uber/RemoteShuffleService
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
linkedin/transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
Tencent/Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
51nb/marble
A high performance in-memory hive sql engine based on Apache Calcite
facebookarchive/hive-io-experimental
Hive I/O Library
18113996630/springboot-spark
通过SparkLauncher使用springboot构建rest api远程提交spark任务,博客地址:https://blog.csdn.net/hlp4207/article/details/100831384
xpleaf/minidubbo
A Full RPC Framework Based on Netty.
xiashuijun/cdhproject
hadoop各组件使用,持续更新
bigdataxiaohan/StreamCQL
Narcasserun/transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.