Pinned Repositories
120-Data-Science-Interview-Questions
Answers to 120 commonly asked data science interview questions.
AD_Prediction
Alzheimer's Disease Prediction by using ResNet, AlexNet
algods
Algorithms & Data Structures collection
dropwizard-cassandra
Dropwizard support for Cassandra
FirstRepo
ml
The Cloudera Data Science Team's Tools for Data Preparation, Machine Learning, and Model Evaluation.
harishraj's Repositories
harishraj/algorithms
Include the common algorithm questions.
harishraj/algorithms-java
Common Algorithms written in Java
harishraj/ansible_tutorial
A tutorial with video and code taking the user from AWS machine creation to their first deployment using Ansible
harishraj/cdh-twitter-example
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
harishraj/code-problems
Common code and interview problems solved in multiple languages
harishraj/Conjecture
Scalable Machine Learning in Scalding
harishraj/DataStructureAndAlgorithmsMadeEasyInJava
Data Structure And Algorithms Made Easy In Java
harishraj/display-advertising-challenge
Criteo/Kaggle Competition of CTR prediction
harishraj/epibook.github.io
Publishes to Github Pages
harishraj/Exploratory_Data_Analysis
This is a repository for any and all code written for the Exploratory Data Analysis Coursera course through Johns Hopkins University.
harishraj/ganitha
scalding powered machine learning
harishraj/geeksforgeeks
Java Solutions to problems from geeksforgeeks.org
harishraj/Getting_and_Cleaning_Data
This is a repository for any and all code written for the Getting and Cleaning Data Coursera course through Johns Hopkins University.
harishraj/hiped2
Source code that accompanies the book "Hadoop in Practice, Second Edition".
harishraj/hive-udfs
Collection of useful Scala-based Hive UDFs.
harishraj/Impatient
source examples to support the "Cascading for the Impatient" blog post series
harishraj/interview
Everything you need to kick ass on your coding interview
harishraj/java-algorithms-implementation
Algorithms and Data Structures implemented in Java
harishraj/kafka-meetup-demo
harishraj/kafka-spark-consumer
harishraj/kafka-storm-starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+, while using Apache Avro as the data serialization format.
harishraj/LP01_DSWAC_0706
Lords first big data demo project, learning from cloudera's ccp-2013: "Data Scientist Web Analytics Challenge: Classification, Clustering, and Collaborative Filtering".
harishraj/marseille
A real time streaming implementation of markov chain based fraud detection
harishraj/RoaringBitmap
A better compressed bitset in Java
harishraj/sifarish
Content based and collaborative filtering based recommendation and personalization engine implementation on Hadoop and Storm
harishraj/stream-lib
Stream summarizer and cardinality estimator.
harishraj/streamparse
streamparse lets you run Python code against real-time streams of data. Integrates with Apache Storm.
harishraj/vowpal_wabbit
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm
harishraj/whirr-cm
harishraj/wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data related infrastructure.