Pinned Repositories
Arvato-Identifying-the-potential-customers
Identify potential customers among the crowd to reduce marketing spend using techniques like PCA, Segmentation (K-Means) and Classification (Random Forest, Gradient Boosting, AdaBoost Classifer, Light Gbm)
data_Analysis-H1B-visa
Disaster-Response-WebApplication
Built a disaster response Web Application that takes input messages during calamity and direct to a particular alleviation organization that can give speedy help
IST-687-INTRO-TO-DATA-SCIENCE
IST-707-DATA-ANALYTICS
IST-722-DATA-WAREHOUSE
Sms-spam-detector
Build the SMS spam detector using PySpark and Logistic Regression and NLP techniques like Tokenization, CountVectorizer, Tfidf
Text-Summarization-An-Extractive-Method-NLP
Utilizing NLTK package and techniques like Tokenization, Stemming, Lemmatization, Computing Similarity and Using Page-Rank Algorithm to summarize a 1-page paragraph in 10 sentences
Wage-Analysis
Utilized Random Forest Classifier to identify features (education, health, age, sex, race.. that will lead to high wages. Utilized techniques like Feature Engineering (StringIndexing, One-Hot Encoding, VectorAssembler) and Modeling (Random Forest Classifier with CV and hyper parameter optimization (MaxDepth, No of trees))
YaayBnb
harshdarji23's Repositories
harshdarji23/Text-Summarization-An-Extractive-Method-NLP
Utilizing NLTK package and techniques like Tokenization, Stemming, Lemmatization, Computing Similarity and Using Page-Rank Algorithm to summarize a 1-page paragraph in 10 sentences
harshdarji23/IST-722-DATA-WAREHOUSE
harshdarji23/data_Analysis-H1B-visa
harshdarji23/Arvato-Identifying-the-potential-customers
Identify potential customers among the crowd to reduce marketing spend using techniques like PCA, Segmentation (K-Means) and Classification (Random Forest, Gradient Boosting, AdaBoost Classifer, Light Gbm)
harshdarji23/IST-687-INTRO-TO-DATA-SCIENCE
harshdarji23/IST-707-DATA-ANALYTICS
harshdarji23/Wage-Analysis
Utilized Random Forest Classifier to identify features (education, health, age, sex, race.. that will lead to high wages. Utilized techniques like Feature Engineering (StringIndexing, One-Hot Encoding, VectorAssembler) and Modeling (Random Forest Classifier with CV and hyper parameter optimization (MaxDepth, No of trees))
harshdarji23/Data_Science-Bootcamp-ML
Udemy data science with python Bootcamp covering topics from linear regression to Nlp
harshdarji23/Disaster-Response-WebApplication
Built a disaster response Web Application that takes input messages during calamity and direct to a particular alleviation organization that can give speedy help
harshdarji23/IST-659-DATA-ADMIN-CONCEPTS-DB-MGMT
harshdarji23/Sms-spam-detector
Build the SMS spam detector using PySpark and Logistic Regression and NLP techniques like Tokenization, CountVectorizer, Tfidf
harshdarji23/YaayBnb
harshdarji23/Data_Science-Bootcamp-Python-Data-Wrangling-Data-Viz
Data Science Bootcamp on Udemy covering python libraries like pandas, NumPy and data visualization
harshdarji23/hugo-tranquilpeak-theme
A gorgeous responsive theme for Hugo blog framework
harshdarji23/Hypothesis-Testing-Recession-Housing-Prices
harshdarji23/IST-664-NLP
harshdarji23/IST-718-Big-Data-Analytics
harshdarji23/Linear-regression-Scikit-Learn
harshdarji23/map_reduce_gradient_descent
Gradient Descent is one of the most commonly used technique to minimize the loss function. Here, I have use pySpark and Map Reduce technique to demonstrate Gradient Descent Algorithm
harshdarji23/MBC-638-DATA-ANALYSIS-AND-DECSION-MAKING
harshdarji23/Movie-Recommender-System
harshdarji23/my-portfolio-old
harshdarji23/potfolio
This is my portfolio blogging website made with hugo and hosted on netlify
harshdarji23/R-Programming
harshdarji23/Random-Forest-Classifier-Scikit-Learn
harshdarji23/SCM-651-BUSINESS-ANALYTICS
harshdarji23/Twiiter-Sentiment-Analysis
harshdarji23/Windows-Process-Classifier
Pyspark project to classify Windows process logs as benign or malicious
harshdarji23/Yocket-Data-Analysis
harshdarji23/Zomata-EDA