Pinned Repositories
backyard
playground, just for fun
bixo
Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading pipe assembly, you can quickly create specialized web mining applications.
bobo
faceted search engine
chartsy
Mirror of the Chartsy - an open source stock charting, screening, and trading platform built on the NetBeans Platform.
chirper
distributed twitter search engine
cleo-primer
A restful web application for real-time typeahead and autocomplete
coweb
Open Cooperative Web Framework
metrics
Capturing JVM- and application-level metrics. So you know what's going on.
play-cookbook
Source code for most of the recipes featured in the play framework cookbook
search-perf
sguo's Repositories
sguo/coweb
Open Cooperative Web Framework
sguo/metrics
Capturing JVM- and application-level metrics. So you know what's going on.
sguo/play-cookbook
Source code for most of the recipes featured in the play framework cookbook
sguo/search-perf
sguo/backyard
playground, just for fun
sguo/bobo
faceted search engine
sguo/chartsy
Mirror of the Chartsy - an open source stock charting, screening, and trading platform built on the NetBeans Platform.
sguo/cleo-primer
A restful web application for real-time typeahead and autocomplete
sguo/datafu
Hadoop library for large-scale data processing
sguo/DirectMemory
DirectMemory is a cache implementation featuring off-heap memory storage (a-la BigMemory) to enable caching of large (or large numbers of) objects without degrading jvm performance. Its main purpose is to act as a second level cache (after a heap based one) to collect large amounts of data without filling up the java heap and thus avoiding long garbage collection cycles. Included in the box is a small set of utility classes to easily handle off-heap memory buffers
sguo/elasticsearch
Open Source, Distributed, RESTful Search Engine
sguo/java-memcached-client
A simple, asynchronous, single-threaded memcached client written in java.
sguo/jekyll
Jekyll is a blog-aware, static site generator in Ruby
sguo/kafka
A distributed publish/subscribe messaging service
sguo/kamikaze
DocId set compression and set operation library
sguo/monitor-core
Ganglia Monitoring core
sguo/nessDB
A very fast key-value,embedded Database Storage Engine (Using log-structured-merge (LSM) trees) with Level-LRU, Bloom-Filter,and supports Redis-Protocol(PING,SET,MSET,GET,MGET,DEL,EXISTS,INFO,SHUTDOWN).
sguo/pig-vector
Mahout vector encoding for pig
sguo/sensei
distributed realtime searchable database
sguo/sin
sguo/zoie
realtime search/indexing system
sguo/go
go
sguo/impala
sguo/katta
Katta - distributed Lucene
sguo/lsa
lsa research and source
sguo/LSH-Hadoop
Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations
sguo/netty
Netty project - an event-driven asynchronous network application framework
sguo/opencv
OpenCV GitHub Mirror
sguo/stream-lib
Stream summarizer and cardinality estimator.
sguo/tensorflow
Computation using data flow graphs for scalable machine learning