Pinned Repositories
arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
spark
Apache Spark - A unified analytics engine for large-scale data processing
zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
incubator-zeppelin
Mirror of Apache Zeppelin (Incubating)
lightning-scala
Scala client for the Lightning data visualization server (WIP)
spark-ml-streaming
Visualize streaming machine learning in Spark
spark-notebook-examples
Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin
vagrant-projects
Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
graphframes
mooc-setup
Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course
felixcheung's Repositories
felixcheung/spark-notebook-examples
Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin
felixcheung/vagrant-projects
Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
felixcheung/spark-ml-streaming
Visualize streaming machine learning in Spark
felixcheung/lightning-scala
Scala client for the Lightning data visualization server (WIP)
felixcheung/incubator-zeppelin
Mirror of Apache Zeppelin (Incubating)
felixcheung/spark-k8s
Apache Spark to run on Kubernetes
felixcheung/spark-build
Build Apache Spark
felixcheung/tensorframes
Tensorflow wrapper for DataFrames on Apache Spark
felixcheung/zeppelin
Zeppelin is data analytics environment
felixcheung/alluxio
Alluxio, formerly Tachyon, A Virtual Distributed Storage at Memory Speed
felixcheung/ambari
Mirror of Apache Ambari
felixcheung/bigtop
Mirror of Apache Bigtop
felixcheung/bigtop-build
Script and tools to build with Apache Bigtop
felixcheung/dataproc-initialization-actions
Run in all nodes of your cluster before the cluster starts - let's you customize your cluster
felixcheung/dotfiles
felixcheung/felixcheung.github.io
felixcheung/flink
Mirror of Apache Flink
felixcheung/graphframes
felixcheung/incubator-airflow
Apache Airflow (Incubating)
felixcheung/incubator-gearpump
Mirror of Apache Gearpump (Incubating)
felixcheung/incubator-predictionio
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
felixcheung/kubernetes-kafka
Kafka cluster as Kubernetes StatefulSet, and as simple static setup while PetSet is alpha
felixcheung/lightning
Data Visualization Server
felixcheung/presto-hdinsight
Presto on Azure HDInsight
felixcheung/spark
Mirror of Apache Spark
felixcheung/spark-website
Mirror of Apache Spark Website
felixcheung/SparkR-pkg
R frontend for Spark
felixcheung/sparkrrr
felixcheung/test_helper