Pinned Repositories
Assignment
Assignment-03-Q1--Hypothesis-Testing
Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit
Assignment-03-Q2--Hypothesis-Testing
Anova ftest statistics A hospital wants to determine whether there is any difference in the average Turn Around Time (TAT) of reports of the laboratories on their preferred list. They collected a random sample and recorded TAT for reports of 4 laboratories. TAT is defined as sample collected to report dispatch. Analyze the data and determine wheth
Assignment-03-Q4-Hypothesis-Testing
Chi2 contengency independence test Q4. TeleCall uses 4 centers around the globe to process customer order forms. They audit a certain % of the customer order forms. Any error in order form renders it defective and has to be reworked before processing. The manager wants to check whether the defective % varies by centre. Please analyze the data at 5
Assignment-03-Q5--Hypothesis-Testing
Chi2 contengency independence test Q5. Fantaloons Sales managers commented that % of males versus females walking in to the store differ based on day of the week. Analyze the data and determine whether there is evidence at 5 % significance level to support this hypothesis. Assume Null Hypothesis as Ho: Independence of categorical variables (% of
Assignment-04-Simple-Linear-Regression-1
Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi
Healthcare_chatbot
Healthcare Chatbot for Symptom Checking Create a chatbot that can help users diagnose potential medical conditions based on symptoms. Use NLP for understanding user inputs, and train ML models on medical data to provide diagnostic suggestions.
LLM-Project-Chatbot-
A research tool where you can give bunch of articles, URLs and then can ask questions it will retreve answer based on those articles. Process article content through LangChain's UnstructuredURL Loader. Construct an embedding vector using OpenAI's embeddings and leverage FAISS, a powerful similarity search library, to enable swift and effective re
Project--Sentiment_Analysis
# Project--Sentiment_Analysis Developed Python script to extract comments data from Amazon and Official site. Performed NLP based Tokenization, Lemmatization, vectorization and processed data in Machine understandable language Have used VADERS, ROBERTA and BERT models to find the sentiment of the reviews and used the ratings on the source to chec
Project-Bankruptcy_Prediction
Using various machine learning models (Logistic Regression, Gaussian Naïve Bayes, KNN, Gradient Boosting Classifier, Decision Tree Classifier, Random Forest Classifier.) to predict whether a company will go bankrupt in the following years, based on financial attributes of the company; Addressed the issue of imbalanced classes, different importance
shwetapardhi's Repositories
shwetapardhi/Healthcare_chatbot
Healthcare Chatbot for Symptom Checking Create a chatbot that can help users diagnose potential medical conditions based on symptoms. Use NLP for understanding user inputs, and train ML models on medical data to provide diagnostic suggestions.
shwetapardhi/Project--Sentiment_Analysis
# Project--Sentiment_Analysis Developed Python script to extract comments data from Amazon and Official site. Performed NLP based Tokenization, Lemmatization, vectorization and processed data in Machine understandable language Have used VADERS, ROBERTA and BERT models to find the sentiment of the reviews and used the ratings on the source to chec
shwetapardhi/Project-Bankruptcy_Prediction
Using various machine learning models (Logistic Regression, Gaussian Naïve Bayes, KNN, Gradient Boosting Classifier, Decision Tree Classifier, Random Forest Classifier.) to predict whether a company will go bankrupt in the following years, based on financial attributes of the company; Addressed the issue of imbalanced classes, different importance
shwetapardhi/Assignment-03-Q1--Hypothesis-Testing
Q1.A F&B manager wants to determine whether there is any significant difference in the diameter of the cutlet between two units. A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validit
shwetapardhi/Assignment-03-Q4-Hypothesis-Testing
Chi2 contengency independence test Q4. TeleCall uses 4 centers around the globe to process customer order forms. They audit a certain % of the customer order forms. Any error in order form renders it defective and has to be reworked before processing. The manager wants to check whether the defective % varies by centre. Please analyze the data at 5
shwetapardhi/Assignment-03-Q5--Hypothesis-Testing
Chi2 contengency independence test Q5. Fantaloons Sales managers commented that % of males versus females walking in to the store differ based on day of the week. Analyze the data and determine whether there is evidence at 5 % significance level to support this hypothesis. Assume Null Hypothesis as Ho: Independence of categorical variables (% of
shwetapardhi/Assignment-04-Simple-Linear-Regression-1
Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi
shwetapardhi/Assignment-04-Simple-Linear-Regression-2
Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.
shwetapardhi/Assignment-05-Multiple-Linear-Regression-1
Multiple-Linear-Regression-1. Consider only the below columns and prepare a prediction model for predicting Price of Toyota Corolla.p
shwetapardhi/Assignment-05-Multiple-Linear-Regression-2
Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t
shwetapardhi/LLM-Project-Chatbot-
A research tool where you can give bunch of articles, URLs and then can ask questions it will retreve answer based on those articles. Process article content through LangChain's UnstructuredURL Loader. Construct an embedding vector using OpenAI's embeddings and leverage FAISS, a powerful similarity search library, to enable swift and effective re
shwetapardhi/Assignment-10-Recommendation-System
shwetapardhi/Assignment-11-Text-Mining
shwetapardhi/Assignment-12-Naive-Bayes
shwetapardhi/Assignment-13-KNN
shwetapardhi/Assignment-14-Decision-Tree
shwetapardhi/Assignment-15-Random-Forests
shwetapardhi/Assignment-16-Neural-Networks
shwetapardhi/Assignment-17-Support-Vector-Machine
shwetapardhi/Assignment-18-Forecasting
shwetapardhi/Assignment-2-Set2-Q1--Basic-Statistic-Level-2
Q1. The time required for servicing transmissions is normally distributed with mean = 45 minutes and SD = 8 minutes. The service manager plans to have work begin on the transmission of a customer’s car 10 minutes after the car is dropped off and the customer is told that the car will be ready within 1 hour from drop-off. What is the probability tha
shwetapardhi/Assignment-2-Set2-Q2--Basic-Statistic-Level-2
Q2.The current age (in years) of 400 clerical employees at an insurance claims processing center is normally distributed with mean = 38 and Standard deviation =6. For each statement below, please specify True/False. If false, briefly explain why. A. More employees at the processing center are older than 44 than between 38 and 44. B. A training pr
shwetapardhi/Assignment-2-Set2-Q4--Basic-Statistic-Level-2
Q4. Let X ~ N(100, 20^2). Find two values, a and b, symmetric about the mean, such that the probability of the random variable taking a value between them is 0.99.
shwetapardhi/Assignment-2-Set3-Q5--Basic-Statistic-Level-2
In January 2005, a company that monitors Internet traffic (WebSideStory) reported that its sampling revealed that the Mozilla Firefox browser launched in 2004 had grabbed a 4.6% share of the market. I. If the sample were based on 2,000 users, could Microsoft conclude that Mozilla has a less than 5% share of the market? II. WebSideStory claims that
shwetapardhi/Assignment-2-Set4-Q3--Basic-Statistic-Level-2
Q3.Auditors at a small community bank randomly sample 100 withdrawal transactions made during the week at an ATM machine located near the bank’s main branch. Over the past 2 years, the average withdrawal amount has been $50 with a standard deviation of $40. Since audit investigations are typically expensive, the auditors decide to not initiate furt
shwetapardhi/Assignment-6-Logistic-Regression
shwetapardhi/Assignment-7-Clustering
shwetapardhi/Assignment-8-PCA
shwetapardhi/Assignment-9-Association-Rule
shwetapardhi/Virtula-Mouse
This project is a hand gesture mouse using OpenCV, Mediapipe and Python. It uses the cam to detect hand gestures and move the mouse accordingly. It also has fuctions to perform left and right clicks, and scroll up and down etc.. Right hand is used to control the mouse and left hand is used to perform other functions such as copy/paste, undo/redo e