This repository contains an analytical project focused on the salaries of Data Scientists. The project involves extensive data analysis to identify key factors influencing salaries, such as location, industry, and experience.
The goal of this project is to perform a detailed analysis of Data Scientist salaries and uncover key influencing factors. The project highlights strong analytical skills and proficiency in Python-based data analysis, demonstrating the ability to derive meaningful insights from complex data.
The dataset used in this project consists of salary data for Data Scientists across various industries, locations, and levels of experience.
Please download the dataset from this link and place it in the root of the project directory.
- Data Preprocessing: Cleaning and preparing the data for analysis.
- Data Analysis: Analyzing the data to identify key influencing factors on salaries.
To run the scripts and notebooks, you'll need to have the following Python libraries installed:
numpy
pandas
matplotlib
seaborn
You can install these libraries using pip
:
pip install numpy pandas matplotlib seaborn
- Clone this repository to your local machine.
- Navigate to the project directory.
- Download and extract the dataset in the project directory.
- Run the Python scripts or Jupyter Notebooks.