VipanchiKatthula
Data Scientist with 5+ years of Experience in Healthcare and Retail. Interested in ML, AI, and data science.
Pinned Repositories
English-to-Telugu-Translator
We built a REST API to convert English sentences to Telugu using LSTM and Keras. Deployed the built model onto AWS while using Docker-container orchestration.
Instagram-Depression-Detection
End-to-end project to predict the mental health status of Instagram users from their posts and images. Built Un-supervised LDA and Semi-supervised Topic Models using text features. Modeled Support Vector Classifier to predict the probability of depression and improved accuracy from 70% to 94.5% at 89% recall and 92% precision.
Product-Recommendation-System
Built content based recommendation and also product based search recommendation system using ALS algorithm on Big Data
VMWare-Customer-Engagement
Improving customer engagement at VMWare through Analytics. The data consists of 700+ variables from 500+ potential customers from VMWare's web analytics team.
DocumentSimilarity_and_Word_Embedding
Finding Similarity between documents using Jaccard Similarity and including POS Tags of the words to better find the synchronous relationship between documents. Using word embedding for visual representation of text data.
PySpark_OperationsOnAmazonTweets
This repository shows the implementation of the Spark context, Spark SQL context on Amazon Tweets data set with 400k Tweets. I dealt with tweet_id (id_str), Tweet_created_time, Retweet_count, Favourite_count to find the days with a high influx of tweets.
Mapper_Reducer_Implementation
Using the "Pride and Prejudice" book as the input file, I executed mapper reducer functionality using python to represent the Hadoop infrastructure. I also defined and used multi-threading to simultaneously execute operations of two mapper and reducer functions.
Titanic_survival_prediction
Repositoy for Medium blog showing how to deploy machine learning models in Tableau while directly giving access to the end-user.
VipanchiKatthula's Repositories
VipanchiKatthula/project-based-learning
Curated list of project-based tutorials
VipanchiKatthula/data-science-interviews
Data science interview questions and answers
VipanchiKatthula/VipanchiKatthula
Home page introduction file
VipanchiKatthula/Titanic_survival_prediction
Repositoy for Medium blog showing how to deploy machine learning models in Tableau while directly giving access to the end-user.
VipanchiKatthula/vipanchikatthula.github.io
Personal Portfolio Website
VipanchiKatthula/academic-kickstart
VipanchiKatthula/Jaccard_Cosine_Similarity
Repository to showcase my projects related to text analytics and Natural Language Processing (NLP)
VipanchiKatthula/Instagram-Depression-Detection
End-to-end project to predict the mental health status of Instagram users from their posts and images. Built Un-supervised LDA and Semi-supervised Topic Models using text features. Modeled Support Vector Classifier to predict the probability of depression and improved accuracy from 70% to 94.5% at 89% recall and 92% precision.
VipanchiKatthula/PySpark_OperationsOnAmazonTweets
This repository shows the implementation of the Spark context, Spark SQL context on Amazon Tweets data set with 400k Tweets. I dealt with tweet_id (id_str), Tweet_created_time, Retweet_count, Favourite_count to find the days with a high influx of tweets.
VipanchiKatthula/TwitterSentimentAnalysis
Twitter Sentiment Analysis
VipanchiKatthula/VMWare-Customer-Engagement
Improving customer engagement at VMWare through Analytics. The data consists of 700+ variables from 500+ potential customers from VMWare's web analytics team.
VipanchiKatthula/English-to-Telugu-Translator
We built a REST API to convert English sentences to Telugu using LSTM and Keras. Deployed the built model onto AWS while using Docker-container orchestration.
VipanchiKatthula/Product-Recommendation-System
Built content based recommendation and also product based search recommendation system using ALS algorithm on Big Data
VipanchiKatthula/DocumentSimilarity_and_Word_Embedding
Finding Similarity between documents using Jaccard Similarity and including POS Tags of the words to better find the synchronous relationship between documents. Using word embedding for visual representation of text data.
VipanchiKatthula/Mapper_Reducer_Implementation
Using the "Pride and Prejudice" book as the input file, I executed mapper reducer functionality using python to represent the Hadoop infrastructure. I also defined and used multi-threading to simultaneously execute operations of two mapper and reducer functions.
VipanchiKatthula/Key-Predictors-of-Wearable-Devices
The code for the analysis of Health Information National Trends Survey data in STATA
VipanchiKatthula/open-neuroscience-website
new implementation of Open Neuroscience website
VipanchiKatthula/connorrothschild.com
👨🏻💻 Personal website
VipanchiKatthula/Sentiment-analysis-of-Twitter-Data