Pinned Repositories
akka
Akka Project
awesome-courses
List of awesome university courses for learning Computer Science!
Baseform-Epanet-Java-Library
A full port of the Epanet modeling library to Java, as a basis for our software platform's network analysis capabilities. This is a complete rewrite of the C language code to a portable, modern and extensible Java library that is hardware- and operating system-independent. The port is a full Epanet engine implementation, including the regular hydraulic and water quality simulation and fully integrating the MSX multi-species extension library.
Car-Price-Prediction-System
Technologies: Nutch 1.6, MapReduce in Java, Mahout. For selling a used car, its price can be predicted by giving some attributes. e.g. Car Model, total miles, engine type. Trained data will be provided to our system to predict the price for new data. Used Nutch to crawl car data from 52 states which is on craigslist.org. Over 0.3 million records were fetched. The content was then pruned using two MapReduce Jobs. The first MapReduce cleaned the data removing unwanted unicode symbols and incomplete data (information without car model or total miles or engine or price). The Second MapReduce extracted the required attributes and emitted in tsv format. This tsv was then provided to a Naïve Based Classifier in Mahout. A classification model was built from the training data. This models predicts the price when attributes like car model model, miles, engine was provided.
Colt
Open Source Libraries for High Performance Scientific and Technical Computing
dataRetrieval
This R package is designed to obtain its water quality sample data, streamflow data, and metadata directly from either the USGS NWIS (National Water Information System), but it allows for user-supplied text files as inputs. The program is designed to ingest the data directly into R and structure them into file structures suited for EGRET analysis.
guangdi_myblog
for personal blog and notebook
ranking
Learning to Rank in TensorFlow
storm-applications
A collection of real-time applications built with Apache Storm.
yahoo-samoa-research
Research of Yahoo SAMOA
positivepsycho's Repositories
positivepsycho/storm-streaming
Sample Storm Topology for stream processing sensor data
positivepsycho/trident-tutorial
A practical Storm Trident tutorial
positivepsycho/first-stories-twitter
How to spot first stories on Twitter using Storm.
positivepsycho/twitter-storm-weibo
Twitter-Storm部署完整资料
positivepsycho/storm-java
write some topologies of storm
positivepsycho/LogEventsProcessing
real time log event processing using storm, kafka, logstash & cassandra
positivepsycho/storm-trident-tutorial
Storm Trident examples
positivepsycho/storm-twitter-word-count
This is a sample project demonstrating real-time computation storm framework integration with twitter.
positivepsycho/PRML
Pattern recognition and machine learning toolbox
positivepsycho/trident-memcached
Trident state implementation for Memcached
positivepsycho/machine-learning
testing and learning machine learning
positivepsycho/streaming-storm
Proof of concept StreamingKMeans + BallKMeans implementation with Storm
positivepsycho/Car-Price-Prediction-System
Technologies: Nutch 1.6, MapReduce in Java, Mahout. For selling a used car, its price can be predicted by giving some attributes. e.g. Car Model, total miles, engine type. Trained data will be provided to our system to predict the price for new data. Used Nutch to crawl car data from 52 states which is on craigslist.org. Over 0.3 million records were fetched. The content was then pruned using two MapReduce Jobs. The first MapReduce cleaned the data removing unwanted unicode symbols and incomplete data (information without car model or total miles or engine or price). The Second MapReduce extracted the required attributes and emitted in tsv format. This tsv was then provided to a Naïve Based Classifier in Mahout. A classification model was built from the training data. This models predicts the price when attributes like car model model, miles, engine was provided.
positivepsycho/storm-contrib
A collection of spouts, bolts, serializers, DSLs, and other goodies to use with Storm
positivepsycho/storm-twitter
Simple example of using Storm to process real-time tweets and store it in mongoDB.
positivepsycho/trident-hackaton
Base project for 10/04/2013 Trident's Hackaton @ Berlin Big Data Beers
positivepsycho/storm-topologies
Storm Topologies (ETL,WordCount,etc.)
positivepsycho/tweetness
Tweetness parses live tweet stream from Twitter and finds the top 10 hashtags using Storm (using RedStorm for Ruby)
positivepsycho/PredictiveAnalyticsCND
Predictive Analytics CND Utilizing Open Source Tool Sets Such as Hadoop & Twitter Storm
positivepsycho/storm-wiki
Twitter Storm wiki as a repo
positivepsycho/stormML
paper on twitter storm and machine learning
positivepsycho/Baseform-Epanet-Java-Library
A full port of the Epanet modeling library to Java, as a basis for our software platform's network analysis capabilities. This is a complete rewrite of the C language code to a portable, modern and extensible Java library that is hardware- and operating system-independent. The port is a full Epanet engine implementation, including the regular hydraulic and water quality simulation and fully integrating the MSX multi-species extension library.
positivepsycho/tweitgeist
realtime Twitter trending hashtags computation using RedStorm / Storm
positivepsycho/machinelearning-with-storm
A storm topology for training Vowpal Wabbit
positivepsycho/kestrel
simple, distributed message queue system
positivepsycho/Colt
Open Source Libraries for High Performance Scientific and Technical Computing
positivepsycho/online-k-clustering
This project implements online-k-clustering algorithm as mentioned in this paper(http://cseweb.ucsd.edu/~dasgupta/291/lec6.pdf). It produces REALTIME k-clustering on an infinite stream of data. It is implemented on top of twitter storm and uses cassandra as database. It deals with 2-dimensional matrices and clusters in Euclidean space.
positivepsycho/akka
Akka Project
positivepsycho/storm-feeds-example
This is a toy example for illustrating the usefulness of Storm in two use cases: stream processing and continuous computation.