takashiyonebayashi's Stars
apache/kafka
Mirror of Apache Kafka
apache/zookeeper
Apache ZooKeeper
apache/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
apache/parquet-java
Apache Parquet Java
cwensel/cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
audienceproject/spark-dynamodb
Plug-and-play implementation of an Apache Spark custom data source for AWS DynamoDB.
LiveRamp/hank
(DEPRECATED. This project is no longer used or maintained at LiveRamp.) Hank is a high performance distributed key-value NoSQL database that we built and use at LiveRamp. It is designed for very large data stores that dwarf the amount of available main memory and for randomly distributed read/write workloads that far exceed the capacity of memory-based caches. More specifically, it is optimized for very low latency random read queries and for very high throughput incremental batch writes.
traviscrawford/spark-dynamodb
DynamoDB data source for Apache Spark
LiveRamp/jack
A set of scripts for generating fully functional Java database models from Ruby's ActiveRecord models and migrations.
LiveRamp/cascading_ext
cascading_ext is a collection of tools built on top of the Cascading platform which make it easy to build, debug, and run simple and high-performance data workflows.
LiveRampArchive/megadesk
Megadesk is a lightweight framework to synchronize and execute distributed workflows and manage corresponding steps and resources. It is built on top of ZooKeeper.
takashiyonebayashi/ramhdfs
An implementation of the FileSystem API in Hadoop that stores everything in memory.
takashiyonebayashi/jack
A set of scripts for generating fully functional Java database models from Ruby's ActiveRecord models and migrations.