This project analyzes the job outcomes of students who graduated from college between 2010 and 2012. The dataset can be found here.
Each row represents a different major and contains information on gender diversity, employment rates, median salaries, and more. Here is a description of the dataset:
Rank
- Rank by median earnings (dataset is order by this columnMajor_code
- A unique code for each majorMajor
- Major's descriptionMajor_category
- Category of the majorTotal
- Total number of people with this majorSample_size
- Unweighted sample size of full-time studentsMen
- Number of male graduatesWomen
- Number of female graduatesSharewomen
- Share of female graduatesEmployed
- Number of employed graduatesMedian
- Median salary of full-time workersLow_wage_jobs
- Graduates in low-wage service jobsFull_time
- Number of graduates employed 35h or morePart_time
- Number of graduates employed less than 35h
Objective: The objective is to visualize different parts of the data based on the college major.
Techniques used:
- Pandas, Numpy, Matplotlib
- Scatter plot, histograms, bar plots, scatter matrix plots
- Grouped bar plot
- Box plot
- Hexagonal bin plot