michael2006a's Stars
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
apache/flink
Apache Flink
apache/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
apache/kylin
Apache Kylin
pawl/awesome-etl
A curated list of awesome ETL frameworks, libraries, and software.
liuhuanyong/TopicCluster
A simple documentary topic analysis implement based on traditional K-means and LDA which can achieve a not-bad result. 基于Kmeans与Lda模型的多文档主题聚类,输入多篇文档,输出每个主题的关键词与相应文本,可用于主题发现与热点分析等应用,如历时话题建模,评论画像等。
coffeehu/CBoard-v
CBoard Vue 版 (BI dashboard platform)