Pinned Repositories
AnomalyDetection
Anomaly Detection with R
autoscale
Master thesis work of Andreas Baakind - University of Oslo
awesome-analytics
A curated list of analytics frameworks, software and other tools.
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
awesome-public-datasets
An awesome list of (large-scale) public datasets on the Internet. (On-going collection)
awesome-selfhosted
A list of Free Software network services and web applications which can be hosted locally. Selfhosting is the process of hosting and managing applications instead of renting from Software-as-a-Service providers
BreakoutDetection
Breakout Detection via Robust E-Statistics
okcoin_client
A python trade client for OKCoin.com. OKCoin比特币交易平台的Python客户端,支持下单交易,取消下单,查看账户信息,查看市场深度等OKCoin平台API提供的功能。
spark-csv
CSV data source for Spark SQL and DataFrames
gchen's Repositories
gchen/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
gchen/awesome-public-datasets
An awesome list of (large-scale) public datasets on the Internet. (On-going collection)
gchen/spark-csv
CSV data source for Spark SQL and DataFrames
gchen/AnomalyDetection
Anomaly Detection with R
gchen/awesome-selfhosted
A list of Free Software network services and web applications which can be hosted locally. Selfhosting is the process of hosting and managing applications instead of renting from Software-as-a-Service providers
gchen/BreakoutDetection
Breakout Detection via Robust E-Statistics
gchen/cassandra
Mirror of Apache Cassandra
gchen/CassandraPerformanceTests
A variety of tests showing good patterns and anti-patterns
gchen/datastax-spark-streaming-demo
Counting Twitter hashtags using Spark Streaming and Cassandra
gchen/deep-learning-from-scratch
『ゼロから作る Deep Learning』のリポジトリ
gchen/druid-io.github.io
Druid Project Website
gchen/Eagle
Apache Eagle - Secure Hadoop in Real Time
gchen/fourinone
Automatically exported from code.google.com/p/fourinone
gchen/incubator-eagle
Mirror of Apache Eagle (Incubating)
gchen/incubator-zeppelin
Mirror of Apache Zeppelin (Incubating)
gchen/kafka-spark-consumer
gchen/kafka-storm-starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
gchen/killrvideo-sample-schema
Sample Cassandra CQL Schema.
gchen/killrweather
KillrWeather is a reference application (in progress) showing how to easily leverage and integrate Apache Spark, Apache Cassandra, and Apache Kafka for fast, streaming computations on time series data in asynchronous Akka event-driven environments.
gchen/learning-spark
Example code from Learning Spark book
gchen/pyspark-cassandra
PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.
gchen/scala.vim
scala.vim
gchen/sirius
Speech and Vision Based Intelligent Personal Assistant
gchen/spark
Mirror of Apache Spark
gchen/spark-cassandra-connector
If you write a Spark application that needs access to Cassandra, this library is for you
gchen/spark-notebook
Use Apache Spark straight from the Browser
gchen/spark-on-cassandra-quickstart
Spark on Cassandra QuickStart Project
gchen/SparkInternals
Notes talking about the design and implementation of Apache Spark
gchen/spas
Spas: Lamda Architecture based on Spark and Cassandra.
gchen/streamDM
Stream Data Mining Library for Spark Streaming