Pinned Repositories
aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
druid-docker
Docker container running Druid.io
hbase-rdd
Spark RDD to read and write from HBase
MLlib-Clustering
this project include KMeans and GMM clustering algorithm now
moa
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
MobileCloudComputing
monaco-languageclient
NPM module to connect Monaco editor with language servers
oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
pythonTest
writing-spark-from-scratch
从零编写spark
benbenqiang's Repositories
benbenqiang/writing-spark-from-scratch
从零编写spark
benbenqiang/aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
benbenqiang/druid-docker
Docker container running Druid.io
benbenqiang/hbase-rdd
Spark RDD to read and write from HBase
benbenqiang/MLlib-Clustering
this project include KMeans and GMM clustering algorithm now
benbenqiang/moa
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
benbenqiang/MobileCloudComputing
benbenqiang/monaco-languageclient
NPM module to connect Monaco editor with language servers
benbenqiang/oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
benbenqiang/pythonTest
benbenqiang/23333
草 我要被爆破叻
benbenqiang/spark
Apache Spark - A unified analytics engine for large-scale data processing
benbenqiang/streamDM
Stream Data Mining Library for Spark Streaming
benbenqiang/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow