knn-imputer

There are 27 repositories under knn-imputer topic.

miriamspsantos/heterogeneous-distance-functions
A collection of heterogeneous distance functions handling missing values.
Language:MATLAB6 1 02
SebastianRokholt/Data-Science-Projects
A repository for various Data Science projects I've worked on, both university-related and in my spare time.
Language:Jupyter Notebook4 1 00
MariaDimopoulou/Churn-Prediction-Customer-Segmentation-in-E-Commerce
This project focuses on predicting customer churn in an e-commerce setting using machine learning techniques.
Language:Jupyter Notebook3 1 01
SINGHxTUSHAR/Sensor-Fault-Detection
Data fetched by wafers is to be passed through the machine learning pipeline and it is to be determined whether the wafer at hand is faulty or not apparently obliterating the need and thus cost of hiring manual labour.
Language:Jupyter Notebook3 1 00
TheMrityunjayPathak/Feature-Engineering
Feature Engineering with Python
Language:Jupyter Notebook2 1 0
ZL63388/data-preparation-codes
This repository is a collection of basic code templates for Data Preparation. All codes I am sharing are from the practical exercises I did from the Data Science Infinity Program.
Language:Python2 1 01
mahnoorsheikh16/Credit-Card-Default-Prediction
This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.
Language:Jupyter Notebook1 1 00
zuhaib1214/Feature-Engineering
This repository is totally focused on Feature Engineering Concepts in detail, I hope you'll find it helpful.
Language:Jupyter Notebook1 1 00
Allen-Ho-0302/First-Time-Eligible-Arbitration-Salary-Prediction
Modelling the relationship between a player’s first-time eligible arbitration salary and multiple variables.
Language:Jupyter Notebook0 1 00
AyushTyagi1610/Credit-Risk-Modelling
Built a model to determine the risk associated with extending credit to a borrower. Performed Univariate and Bivariate exploration using various methods such as pair-plot and heatmap to detect outliers and to monitor the behaviour and correlation of the features. Imputed the missing values using KNN Imputer and implemented SMOTE to address the imbalanced data. Trained the model using KNN, Decision Trees, Logistic Regression and Random Forest to achieve the best accuracy of 93%.
Language:Jupyter Notebook0 1 00
bortch/second_hand_UK_car_challenge
Kaggle UK Used Car challenge
Language:Python0 2 00
dfavenfre/customer_deposit_classifier
Streamlit app developed for bank customer deposit prediction, using a fine-tuned XGBClassifier model.
Language:Jupyter Notebook0 1 00
nf-i/data-imputation-python
Data imputation is used when there are missing values in a dataset. It helps fill in these gaps with estimated values, enabling analysis and modeling. Imputation is crucial for maintaining dataset integrity and ensuring accurate insights from incomplete data.
Language:Python0 1 00
NMARGOS/HousePricePrediction
[Kaggle Submission] -Using XGBRegressor with shap, grid search and hyperopt to predict house prices
Language:Jupyter Notebook0 1 00
ntyblco/ML_Prediction_RF_KNN
Predicting employee burnout using machine learning algorithms: Random Forest and k-Nearest Neighbors.
0 1 00
SamKazan/fraud-detection-ml
Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.
Language:Jupyter Notebook0 1 00
sayukiusui/Capstone-IDSCP
My Capstone for the HarvardX Course "Introduction to Data Science with Python"
Language:Jupyter Notebook0 1 00
Seghelicious/Cars45
Language:Jupyter Notebook0 1 00
whoisksy/predict-home-loan-sanction-amount
Language:Jupyter Notebook0 1 00
YaserEleraky/Aviation-Accident-NTSB-The-National-Transportation-Safety-Board
Analysis about Accident Aviation from 1962 up to 2023
Language:Jupyter Notebook00
ZG3Z/bts-weather-clustering
Language:Jupyter Notebook0 1 00
AmbreenMahhoor/What-Is-Complete-Case-Analysis-Or-CCA
Language:Jupyter Notebook1 0
Gui-Sitton/Zyfra
The company develops efficiency solutions for heavy industry. The model should predict the amount of pure gold extracted from gold ore. You have the data on extraction and purification. The model will help optimize production and eliminate unprofitable parameters.
Language:Jupyter Notebook
HuzeyfeAyaz/Knn-Imputer-With-Hamming-Distance
Filling missed data-points with the most common values among nearest neighbors
Language:Python2 0
kritika755/wafer_circleci
This flask web app is used to detect if a wafer(sensor chip) is default or not based on sensor readings.
Language:Python1 0
nani757/multivariate-analysis
the multivariate analysis compares different rows and columns for beat accuracy eg:knn imputer in univariate analysis it only compares with the same columns eg mean or median for numbers
Language:Jupyter Notebook1 0
YD5463/TabularDataProject
we perpuse a method to fill nan values using clustering
Language:Jupyter Notebook1 0

knn-imputer

miriamspsantos/heterogeneous-distance-functions

SebastianRokholt/Data-Science-Projects

MariaDimopoulou/Churn-Prediction-Customer-Segmentation-in-E-Commerce

SINGHxTUSHAR/Sensor-Fault-Detection

TheMrityunjayPathak/Feature-Engineering

ZL63388/data-preparation-codes

mahnoorsheikh16/Credit-Card-Default-Prediction

zuhaib1214/Feature-Engineering

Allen-Ho-0302/First-Time-Eligible-Arbitration-Salary-Prediction

AyushTyagi1610/Credit-Risk-Modelling

bortch/second_hand_UK_car_challenge

dfavenfre/customer_deposit_classifier

nf-i/data-imputation-python

NMARGOS/HousePricePrediction

ntyblco/ML_Prediction_RF_KNN

SamKazan/fraud-detection-ml

sayukiusui/Capstone-IDSCP

Seghelicious/Cars45

whoisksy/predict-home-loan-sanction-amount

YaserEleraky/Aviation-Accident-NTSB-The-National-Transportation-Safety-Board

ZG3Z/bts-weather-clustering

AmbreenMahhoor/What-Is-Complete-Case-Analysis-Or-CCA

Gui-Sitton/Zyfra

HuzeyfeAyaz/Knn-Imputer-With-Hamming-Distance

kritika755/wafer_circleci

nani757/multivariate-analysis

YD5463/TabularDataProject