/DS_Salaries

Primary LanguageJupyter Notebook

Data Scientist Salaries Analysis

This repository contains an analytical project focused on the salaries of Data Scientists. The project involves extensive data analysis to identify key factors influencing salaries, such as location, industry, and experience.

Project Overview

The goal of this project is to perform a detailed analysis of Data Scientist salaries and uncover key influencing factors. The project highlights strong analytical skills and proficiency in Python-based data analysis, demonstrating the ability to derive meaningful insights from complex data.

Dataset

The dataset used in this project consists of salary data for Data Scientists across various industries, locations, and levels of experience.

Please download the dataset from this link and place it in the root of the project directory.

Steps to follow

  1. Data Preprocessing: Cleaning and preparing the data for analysis.
  2. Data Analysis: Analyzing the data to identify key influencing factors on salaries.

Requirements

To run the scripts and notebooks, you'll need to have the following Python libraries installed:

  • numpy
  • pandas
  • matplotlib
  • seaborn

You can install these libraries using pip:

pip install numpy pandas matplotlib seaborn

Usage

  1. Clone this repository to your local machine.
  2. Navigate to the project directory.
  3. Download and extract the dataset in the project directory.
  4. Run the Python scripts or Jupyter Notebooks.