chengat1314
I am a Data geek, working on big data analytics. Data Engineering + Data Science
GrabSingapore
Pinned Repositories
2018.scalamatsuri.org
ScalaMatsuri 2018 のウェブサイト http://2018.scalamatsuri.org
android_sdk
This is the Android SDK of
awesome-engineering-team-management
👔 How to transition from software development to engineering management
awesome-opensource-data-engineering
An Awesome List of Open-Source Data Engineering Projects
aws-glue-samples
AWS Glue code samples
AWS_P3_Workshop
charts
Curated applications for Kubernetes
CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
ekaf
A minimal, high-performance Kafka client in Erlang.
nus_modules
nus_modules_code
chengat1314's Repositories
chengat1314/amazon-dsstne
Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models
chengat1314/folium
Python Data. Leaflet.js Maps.
chengat1314/DataX
DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
chengat1314/magellan
Geo Spatial Data Analytics on Spark
chengat1314/PredictionIO
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
chengat1314/skill-map
StuQ 技能图谱
chengat1314/zeppelin-notebooks
Gallery of Apache Zeppelin notebooks
chengat1314/sklearn_tutorial
Materials for my scikit-learn tutorial
chengat1314/livy
Livy is an open source REST interface for interacting with Apache Spark from anywhere
chengat1314/SparkStreamingHBaseExample
Spark Streaming HBase Example
chengat1314/nyc-taxi-data
Import public NYC taxi and Uber trip data into PostgreSQL / PostGIS database, analyze with R
chengat1314/coursera-dl
Script for downloading Coursera.org videos and naming them.
chengat1314/caravel
Caravel is a data exploration platform designed to be visual, intuitive, and interactive
chengat1314/strata_data
A repo of sample data for our PyData Tutorial!
chengat1314/CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
chengat1314/spark_streaming_kinesis_demo
Demo code for sending data to a Kinesis stream & processing it with Spark
chengat1314/rdds-dataframes-datasets-nescala-2016
Source for "RDDs, DataFrames and Datasets in Apache Spark" NEScala presentation
chengat1314/ds-for-telco
Source material for Data Science for Telecom Tutorial at Strata Singapore 2015
chengat1314/scala-redis
A scala library for connecting to a redis server, or a cluster of redis nodes using consistent hashing on the client side.
chengat1314/postgis_talk
PostGIS Talk for Maptime Seattle
chengat1314/fingerprintjs2
Modern & flexible browser fingerprinting library, a successor to the original fingerprintjs
chengat1314/vincent
A Python to Vega translator
chengat1314/my-git
Individual collecting material of learning git(有关 git 的学习资料)
chengat1314/geohash
Python module to decode/encode Geohashes to/from latitude and longitude. See http://en.wikipedia.org/wiki/Geohash
chengat1314/theano_exercises
Exercises for my tutorials on Theano
chengat1314/kafka-connect-blog
Demo for Kafka Connect with JDBC and HDFS Connectors
chengat1314/qqzeng-ip
最新IP地址数据库-多语言解析以及导入数据库脚本
chengat1314/strata-singapore
repo for use during workshop at strata singapore http://conferences.oreilly.com/strata/big-data-conference-sg-2015/public/schedule/detail/45383
chengat1314/Impala
Real-time Query for Hadoop
chengat1314/kudu
Kudu is the engine behind git/hg deployments, WebJobs, and various other features in Azure Web Sites. It can also run outside of Azure.