Pinned Repositories
Adhoc_Big_data_Internship_Assignments
Assignments for Adhoc Big Data Internship
Analaysis-of-Utility-of-Stack-Overflow-for-programming-languages
This project is an analysis of usefulness of Stack Overflow using K-means Cluster Algorithm(A BSP algorithm)
Analysis-of-Hollywood-Movies-Using-HDFS-and-Mapreduce-APIs
This project is an analysis of the number of hollywood movies made from 1913 to 2014 using as HDFS as file distribution system and using Mapreduce Framework as execution engine.
Analysis_of_Time_usage_using-_SparkSQL
This project is an analysis of time usage using SparkSQL and other Spark APIs
anand
Clustering_Weather_Dataset
Cluster Analysis on a Weather Dataset to identify different Weather patterns using K-Means Clustering Algorithm Using Spark Execution Engine over a Databricks Cluster
Competitive_Programming
It consists of some competitive programming questions and questions for practice solved before
CSIR_scrapping
It includes data scrapping work using BeautifulSoup(Python) in CSIR-CDRI internship.It also includes work involved of data cleansing and visualization of data given in form of excel sheets which is first cleansed using xlrd module and then visulaized using Matplotlib.
hadoop_cricket_analysis
Cricket Data Analytics Using HDFS and Mapreduce APIs
Kafka_Spark_Streaming
This repository consists of code written in scala which takes in streaming data from kafka consumer client and runs using spark-submit to catch on the streaming data
anaballe's Repositories
anaballe/hadoop_cricket_analysis
Cricket Data Analytics Using HDFS and Mapreduce APIs
anaballe/Analysis-of-Hollywood-Movies-Using-HDFS-and-Mapreduce-APIs
This project is an analysis of the number of hollywood movies made from 1913 to 2014 using as HDFS as file distribution system and using Mapreduce Framework as execution engine.
anaballe/Kafka_Spark_Streaming
This repository consists of code written in scala which takes in streaming data from kafka consumer client and runs using spark-submit to catch on the streaming data
anaballe/Adhoc_Big_data_Internship_Assignments
Assignments for Adhoc Big Data Internship
anaballe/Analaysis-of-Utility-of-Stack-Overflow-for-programming-languages
This project is an analysis of usefulness of Stack Overflow using K-means Cluster Algorithm(A BSP algorithm)
anaballe/Analysis_of_Time_usage_using-_SparkSQL
This project is an analysis of time usage using SparkSQL and other Spark APIs
anaballe/anand
anaballe/Clustering_Weather_Dataset
Cluster Analysis on a Weather Dataset to identify different Weather patterns using K-Means Clustering Algorithm Using Spark Execution Engine over a Databricks Cluster
anaballe/Competitive_Programming
It consists of some competitive programming questions and questions for practice solved before
anaballe/CSIR_scrapping
It includes data scrapping work using BeautifulSoup(Python) in CSIR-CDRI internship.It also includes work involved of data cleansing and visualization of data given in form of excel sheets which is first cleansed using xlrd module and then visulaized using Matplotlib.
anaballe/Decision-Tree-Through-Spark
This project is used to predict weather forecast(low humidity days - susceptitbility for forest fire) through implementation of a Decision-Tree Supervised learning algorithm on Spark Execution Engine over Databricks Cluster (its Community Cloud Service)
anaballe/EEFL_Hackathon_CCMS
This repository consists of challenges 1 and 2 prototype model for the CCMS .
anaballe/Hive_Regex_Serde
This project uses Hive managed and external tables on HDFS space and is a analysis of the stackoverflow website and uses its input directly as the html page
anaballe/Kafka_Consumer_Producer_Scripts
This repository consists of Kafka custom Consumer and Producer clients written using Kafka APIs in Java
anaballe/Peer_to_Peer_Chatbox
This project is a peer to peer chatbox using a mysql database and is built in Java using Swing for developing GUI.
anaballe/UpGrad_Big_Data_Task
anaballe/util
anaballe/zookeeper_Znode_Operations_Using_Java_APIs
This repository consists of set of operations performed on Znodes using Zookeeper APIs.