Pinned Repositories
akela
A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.
asynchbase
A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.
avro-mr
avro format for mapreduce
cassandra
Mirror of Apache Cassandra (incubating)
datafu
Hadoop library for large-scale data processing
elasticsearch
Open Source, Distributed, RESTful Search Engine
feathers
Java classes that can be useful for Dumbo programs that run on Hadoop Streaming.
finagle
A fault tolerant, protocol-agnostic RPC system
flume
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.
olap4cloud
HBase based OLAP engine
yuanke's Repositories
yuanke/avro-mr
avro format for mapreduce
yuanke/parquet-mr
column storage on hadoop
yuanke/brickhouse
Hive UDF's for the data warehouse
yuanke/cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows on a Hadoop cluster. See https://github.com/Cascading/cascading for the release repository.
yuanke/cn-clojure-meetup
cn-clojure聚会资料
yuanke/curator
ZooKeeper client wrapper and rich ZooKeeper framework
yuanke/discovery
yuanke/exchange
Play Framework + Cassandra + Astyanax + D3.js for viewing Euro exchange rates from the ECB
yuanke/Exchange-Simulator
an exchange system simulator and simulated the basic performances
yuanke/HA-Monitor
yuanke/hazelcast
Open Source In-Memory Data Grid
yuanke/hazelcast-book-examples
yuanke/hbase_demos
Hbase demonstration code as an addendum to the presentation
yuanke/IIS-0916043
Ivory: A Hadoop Toolkit for Distributed Text Retrieval
yuanke/IIS-1218043
Providing Relevant and Timely Results: Real-Time Search Architectures and Relevance Algorithms
yuanke/Mr.LDA
Scalable Topic Modeling using Variational Inference in MapReduce
yuanke/mupd8
Muppet
yuanke/netty
Netty project - an event-driven asynchronous network application framework
yuanke/NovaOrc
yuanke/pangool
Tuple MapReduce for Hadoop: Hadoop API made easy
yuanke/pangool-bootstrap
Example project for building apps with Pangool
yuanke/pangool-flow
Pangool-Flow is an experimental module on top of Pangool (http://pangool.net) which adds automatic flow building and management, parallel execution and high-level constructs.
yuanke/range-benchmark
yuanke/scala-meetup-spray
Spray example for the Scala Developers Barcelona meetup
yuanke/SparkFirst
first use sbt and scala to write spark application
yuanke/sparrow
Sparrow scheduling platform (U.C. Berkeley).
yuanke/storm-yarn
Storm for Yarn
yuanke/Turbine
Low latency high throughput aggregator for real time streams
yuanke/WeiboMsgBackupGUI
**爬盟出品的微博备份神器:用于备份新浪微博指定用户全部微博的备份工具
yuanke/zeno
Netflix's In-Memory Data Propagation Framework