anaballe

SDE-2 at AI@ShareChat

ShareChat

Pinned Repositories

Adhoc_Big_data_Internship_Assignments
Assignments for Adhoc Big Data Internship
Language:Jupyter Notebook01
Analaysis-of-Utility-of-Stack-Overflow-for-programming-languages
This project is an analysis of usefulness of Stack Overflow using K-means Cluster Algorithm(A BSP algorithm)
Language:Scala01
Analysis-of-Hollywood-Movies-Using-HDFS-and-Mapreduce-APIs
This project is an analysis of the number of hollywood movies made from 1913 to 2014 using as HDFS as file distribution system and using Mapreduce Framework as execution engine.
Language:Java11
Analysis_of_Time_usage_using-_SparkSQL
This project is an analysis of time usage using SparkSQL and other Spark APIs
Language:Scala02
anand
Language:CSS01
Clustering_Weather_Dataset
Cluster Analysis on a Weather Dataset to identify different Weather patterns using K-Means Clustering Algorithm Using Spark Execution Engine over a Databricks Cluster
Language:Jupyter Notebook02
Competitive_Programming
It consists of some competitive programming questions and questions for practice solved before
Language:Java00
CSIR_scrapping
It includes data scrapping work using BeautifulSoup(Python) in CSIR-CDRI internship.It also includes work involved of data cleansing and visualization of data given in form of excel sheets which is first cleansed using xlrd module and then visulaized using Matplotlib.
Language:Jupyter Notebook01
hadoop_cricket_analysis
Cricket Data Analytics Using HDFS and Mapreduce APIs
Language:Java34
Kafka_Spark_Streaming
This repository consists of code written in scala which takes in streaming data from kafka consumer client and runs using spark-submit to catch on the streaming data
Language:Scala10

anaballe's Repositories

anaballe/hadoop_cricket_analysis
Cricket Data Analytics Using HDFS and Mapreduce APIs
Language:Java34
anaballe/Analysis-of-Hollywood-Movies-Using-HDFS-and-Mapreduce-APIs
This project is an analysis of the number of hollywood movies made from 1913 to 2014 using as HDFS as file distribution system and using Mapreduce Framework as execution engine.
Language:Java11
anaballe/Kafka_Spark_Streaming
This repository consists of code written in scala which takes in streaming data from kafka consumer client and runs using spark-submit to catch on the streaming data
Language:Scala10
anaballe/Adhoc_Big_data_Internship_Assignments
Assignments for Adhoc Big Data Internship
Language:Jupyter Notebook01
anaballe/Analaysis-of-Utility-of-Stack-Overflow-for-programming-languages
This project is an analysis of usefulness of Stack Overflow using K-means Cluster Algorithm(A BSP algorithm)
Language:Scala01
anaballe/Analysis_of_Time_usage_using-_SparkSQL
This project is an analysis of time usage using SparkSQL and other Spark APIs
Language:Scala02
anaballe/anand
Language:CSS01
anaballe/Clustering_Weather_Dataset
Cluster Analysis on a Weather Dataset to identify different Weather patterns using K-Means Clustering Algorithm Using Spark Execution Engine over a Databricks Cluster
Language:Jupyter Notebook02
anaballe/Competitive_Programming
It consists of some competitive programming questions and questions for practice solved before
Language:Java00
anaballe/CSIR_scrapping
It includes data scrapping work using BeautifulSoup(Python) in CSIR-CDRI internship.It also includes work involved of data cleansing and visualization of data given in form of excel sheets which is first cleansed using xlrd module and then visulaized using Matplotlib.
Language:Jupyter Notebook01
anaballe/Decision-Tree-Through-Spark
This project is used to predict weather forecast(low humidity days - susceptitbility for forest fire) through implementation of a Decision-Tree Supervised learning algorithm on Spark Execution Engine over Databricks Cluster (its Community Cloud Service)
Language:Jupyter Notebook3
anaballe/EEFL_Hackathon_CCMS
This repository consists of challenges 1 and 2 prototype model for the CCMS .
Language:Jupyter Notebook1
anaballe/Hive_Regex_Serde
This project uses Hive managed and external tables on HDFS space and is a analysis of the stackoverflow website and uses its input directly as the html page
Language:Jupyter Notebook1
anaballe/Kafka_Consumer_Producer_Scripts
This repository consists of Kafka custom Consumer and Producer clients written using Kafka APIs in Java
Language:Java
anaballe/Peer_to_Peer_Chatbox
This project is a peer to peer chatbox using a mysql database and is built in Java using Swing for developing GUI.
Language:Java
anaballe/UpGrad_Big_Data_Task
Language:Java1
anaballe/util
anaballe/zookeeper_Znode_Operations_Using_Java_APIs
This repository consists of set of operations performed on Znodes using Zookeeper APIs.
Language:Java

anaballe

Pinned Repositories

Adhoc_Big_data_Internship_Assignments

Analaysis-of-Utility-of-Stack-Overflow-for-programming-languages

Analysis-of-Hollywood-Movies-Using-HDFS-and-Mapreduce-APIs

Analysis_of_Time_usage_using-_SparkSQL

anand

Clustering_Weather_Dataset

Competitive_Programming

CSIR_scrapping

hadoop_cricket_analysis

Kafka_Spark_Streaming

anaballe's Repositories

anaballe/hadoop_cricket_analysis

anaballe/Analysis-of-Hollywood-Movies-Using-HDFS-and-Mapreduce-APIs

anaballe/Kafka_Spark_Streaming

anaballe/Adhoc_Big_data_Internship_Assignments

anaballe/Analaysis-of-Utility-of-Stack-Overflow-for-programming-languages

anaballe/Analysis_of_Time_usage_using-_SparkSQL

anaballe/anand

anaballe/Clustering_Weather_Dataset

anaballe/Competitive_Programming

anaballe/CSIR_scrapping

anaballe/Decision-Tree-Through-Spark

anaballe/EEFL_Hackathon_CCMS

anaballe/Hive_Regex_Serde

anaballe/Kafka_Consumer_Producer_Scripts

anaballe/Peer_to_Peer_Chatbox

anaballe/UpGrad_Big_Data_Task

anaballe/util

anaballe/zookeeper_Znode_Operations_Using_Java_APIs