ddkongbb's Stars
alibaba/canal
阿里巴巴 MySQL binlog 增量订阅&消费组件
wesm/pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
alibaba/otter
阿里巴巴分布式数据库同步系统(解决中美异地机房)
Alluxio/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
apache/gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
dropbox/PyHive
Python interface to Hive and Presto. 🐝
miguno/kafka-storm-starter
[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
twitter/bijection
Reversible conversions between types
gwenshap/kafka-examples
Snippets and small examples demonstrating kafka features and configs
hortonworks-spark/shc
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
nerdammer/spark-hbase-connector
Connect Spark to HBase for reading and writing data with ease
ShifuML/shifu
An end-to-end machine learning and data mining framework on Hadoop
prestodb/presto-python-client
Python DB-API client for Presto
jpmml/jpmml-model
Java Class Model API for PMML
LongJunCai/DeepDriver
DeepDriver is a JAVA framework of Deep Learning, it supports ANN/CNN/DNN/RNN/LSTM now, hope it can be widely used for deep learning development.
wypb/spark-summit-2017-SanFrancisco
spark summit 2017 SanFrancisco
Kyligence/ssb-kylin
Star Schema Benchmark Tool for Apache Kylin
Teradata/presto
Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data
aseigneurin/kafka-sandbox
mvalleavila/Kafka-Spark-Hbase-Example
albertoRamon/Kylin
See Apache Kylin Website for a complete description
skynyrd/kafka-connect-elastic-sink
Kafka connect Elastic sink connector, with just in time index/delete behaviour.
guofei1219/BinlogAnalysis
解析Mysql binlog日志并发至Kafka
cgivre/drillworkshop
Repository for the Apache Drill Workshop
aerospike-community/aerospike-hadoop
Aerospike Hadoop Connector
Aegeaner/kafka-connector-mysql
Kafka connector for MySQL
treasure-data/presto_legacy
Distributed SQL query engine for big data
lucrussell/kafka-tools
Collection of scripts for working with Kafka
ShifuML/shifu-spark
An Alternative Spark Implementation of Shifu 'Eval' Step