/ML_Help_Script

Data preparation & Exploration Scripts

Primary LanguagePython

Useful Python scripts for Data Analysis
Полезные скрипты на Python для первичного Анализа Данных

  • Data Cleansing.py – Templates of basic data cleansing techniques, like dropping duplicates and filling null values
  • Exploratory Analysis.py – Templates of data exploring techniques, like describing datatypes, grouping and aggregating data
  • Feature Engineering.py – Templates for basic Feature Engineering
  • Model_Scikit-learn.py – Templates for creating ML Models including train and test split, providing pipelines, applying cross-validations with GridSearch calculating Mean Absolute Error, R Squared, Classification problem with ROC and AUC, Clustering using KMeans
  • PCA.py – Templates for Principal Component Analysis implemention
  • Libraries used: pandas, numpy, matplotlib, seaborn, sklearn