Pinned Repositories
aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
ace
Mirror of Apache ACE (incubating)
alienvault-ossim
Alienvault ossim
alloy-ui
AlloyUI is a framework built on top of YUI3 (JavaScript) that uses Bootstrap 3 (HTML/CSS) to provide a simple API for building high scalable applications
ambari
Mirror of Apache Ambari
ambari-grafana
Integrate Grafana with Ambari Metrics System
flamingo2
Flamingo Big Data Platform
netdata
Real-time performance monitoring, done right!
ostinato
Ostinato - Packet/Traffic Generator and Analyzer
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
cccnam5158's Repositories
cccnam5158/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
cccnam5158/aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
cccnam5158/awesome
:sunglasses: Curated list of awesome lists
cccnam5158/awesome-datascience
:memo: An awesome Data Science repository to learn and apply for real world problems.
cccnam5158/beaker-notebook
Web-based, polyglot research platform.
cccnam5158/beamexample
An example Apache Beam project.
cccnam5158/beats
:tropical_fish: Beats - Lightweight shippers for Elasticsearch & Logstash
cccnam5158/cdap
Cask Data Application Platform (CDAP)
cccnam5158/data-prep
OS code of Data-prep project
cccnam5158/drunken-data-quality
Spark package for checking data quality
cccnam5158/eclairjs-node
Node.js API for Apache Spark with Remote Client
cccnam5158/elephas
Distributed Deep learning with Keras & Spark
cccnam5158/flamingo-analytics
Integrated Web Service for Log Analytics
cccnam5158/h2o-3
Open Source Fast Scalable Machine Learning API For Smarter Applications (Deep Learning, Gradient Boosting, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA, Stacked Ensembles...)
cccnam5158/incubator-beam
Mirror of Apache Beam (Incubating)
cccnam5158/incubator-toree
Mirror of Apache Toree (Incubating)
cccnam5158/JSAT
Java Statistical Analysis Tool, a Java library for Machine Learning
cccnam5158/kylo
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark. Kylo is licensed under Apache 2.0 and contributed by Think Big, A Teradata Company
cccnam5158/nifi
Mirror of Apache NiFi
cccnam5158/OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
cccnam5158/pleaserun
An attempt to abstract this "init" script madness.
cccnam5158/prometheus
The Prometheus monitoring system and time series database.
cccnam5158/quasar
A NoSQL analytics engine that embraces post-relational analytics and pushes computation to the data.
cccnam5158/slamdata
The web-based front-end for SlamData.
cccnam5158/spark
Mirror of Apache Spark
cccnam5158/spark-notebook
Interactive and Reactive Data Science using Scala and Spark.
cccnam5158/spring-cloud-dataflow
Spring Cloud Data Flow provides orchestration for data microservices, including both stream and task processing
cccnam5158/spring-flo
JavaScript angular based embeddable graphical component for pipeline/graph building and editing
cccnam5158/streamium
Decentralized trustless video streaming using bitcoin payment channels.
cccnam5158/Sysmon
A lightweight platform monitoring tool for Java VMs