/Human-Resources-Analytics

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Human-Resources-Analytics

It contains the exploratory data analysis of the Human Resources Analytics dataset from Kaggle.

To know more about the dataset please follow the link: https://www.kaggle.com/ludobenistant/hr-analytics

Install:

I am using Python 3.6.1 for the project. You need to install the fllowing Python libraries:

  1. NumPy (for documentation:http://www.numpy.org/)
  2. Pandas (for documentation:http://pandas.pydata.org/)
  3. Matplotlib (for documentation: https://matplotlib.org/)
  4. Seaborn (for documentation: https://seaborn.pydata.org/)

I have used Jupyter Notebook for the data exploration.

Code:

The complete code is in the 'HR_notebook.ipynb' file.

Data:

You can see the data in 'HR_comma_sep.csv' file.

To download the data please follow the link:

https://www.kaggle.com/ludobenistant/hr-analytics/downloads/human-resources-analytics.zip

These above file is in the .zip format. Please extract the files to get the .csv file out of it.

Data Introduction:

*This dataset is simulated

Why are our best and most experienced employees leaving prematurely? Have fun with this database and try to predict which valuable employees will leave next. Fields in the dataset include:

  1. Satisfaction Level
  2. Last evaluation
  3. Number of projects
  4. Average monthly hours
  5. Time spent at the company
  6. Whether they have had a work accident
  7. Whether they have had a promotion in the last 5 years
  8. Departments
  9. Salary
  10. Whether the employee has left

Hope it helps, Regards - Nilay.