vsingh58's Repositories
vsingh58/spatial
Neo4j Spatial is a library of utilities for Neo4j that faciliates the enabling of spatial operations on data. In particular you can add spatial indexes to already located data, and perform spatial operations on the data like searching for data within specified regions or within a specified distance of a point of interest. In addition classes are provided to expose the data to geotools and thereby to geotools enabled applications like geoserver and uDig.
vsingh58/geotrellis
GeoTrellis is a geographic data processing engine for high performance applications.
vsingh58/SparkLogsAnalyzer
vsingh58/SparkSampleProject
Sample Spark Twitter Sentiment Analysis
vsingh58/zmPDSwR
Example R scripts and data for "Practical Data Science with R" by Nina Zumel and John Mount (Manning Publications)
vsingh58/python-geotrellis
Python utilities around working with GeoTrellis data.
vsingh58/opinion-mining
Aspect-based opinion mining on Yelp reviews
vsingh58/awesome-datascience
An awesome Data Science repository to learn and apply for real world problems.
vsingh58/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
vsingh58/cdh-twitter-example
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
vsingh58/MiA
Mahout in Action Example Code
vsingh58/blog-spark-streaming-log-aggregation
Example of use of Spark Streaming with Kafka
vsingh58/pytenn2014_tutorial
PyTennessee 2014: Statistical Data Analysis in Python
vsingh58/scipy2014_tutorial
Tutorial: Bayesian Statistical Analysis in Python
vsingh58/datastax-userinteractions-demo
Demo which shows how to insert and query user interactions. The demo data is based on a banks applications
vsingh58/spark
Mirror of Apache Spark
vsingh58/vowpal_wabbit
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm
vsingh58/cascading.avro
Cascading Scheme for the Apache Avro data serialization format
vsingh58/statistical-analysis-python-tutorial
Statistical Data Analysis in Python
vsingh58/mongo-hadoop
MongoDB Connector for Hadoop
vsingh58/UnoExample
MapReduce/Hadoop example that uses regular playing cards to show mapping and reducing.
vsingh58/Impatient
source examples to support the "Cascading for the Impatient" blog post series
vsingh58/h2o
h2o = fast statistical, machine learning & math runtime for bigdata
vsingh58/searchanalytics-bigdata
Customer Product search clicks analytics using big data Hadoop, Hive, Oozie, ElasticSearch, Akka, Spring Data
vsingh58/courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
vsingh58/airline_fleets
A neo4j database of airline fleets.
vsingh58/kaggle-allstate
Allstate Purchase Prediction Challenge on Kaggle
vsingh58/ago-tools
A Python package to assist with administering ArcGIS Online Organizations.
vsingh58/Kaggle_Walmart_Recruiting_Sales_Forecasting
Use historical markdown data to predict store sales
vsingh58/pattern
Machine Learning for Cascading