/U.S.-Airport-Data-Analysis-with-SAS

A project for solving a real-world business problem by applying concepts in the SAS Programming.

Primary LanguageSAS

U.S.-Airport-Data-Analysis-with-SAS

A project for solving a real-world business problem by applying concepts in the SAS Programming.

Business Problem

The first project is to prepare and analyze Transportation Security Administration (TSA) Airport Claims data from 2002 through 2017. The TSA is an agency of the United States Department of Homeland Security that has authority over the security of the traveling public. A claim is filed if you are injured or your property is lost or damaged during the screening process at an airport. Here is what I need to do:

  • Prepare the data.
  • Create one PDF report that analyzes the overall data as well as the data for a dynamically specified state.

Data Information

The data that I used is TSAClaims2002_2017.csv, which was created from the following:

  • TSA Airport Claims data from https://www.dhs.gov/tsa-claims-data.
  • FAA Airport Facilities data from https://www.faa.gov/airports/airport_safety/airportdata_5010/. The TSAClaims2002_2017.csv file was created by concatenating each individual TSA Airport Claims table. After the concatenation, the data was joined with the FAA Airport Facilities data. Here are a few notes regarding the data:
  • All data is public data, and accuracy is not guaranteed.
  • The column Airport_Codes from the TSA Airport Claims data has been joined with Location_ID from the FAA Airports Facilities data. Some Airport_Codes values do not correspond to Location_ID values.
  • Columns in the TSA Airport Claims data have changed over the years. Because of this, some of the original columns were removed from the data for this case study.
  • The column Item_Category does not have consistent input values over the years. For this reason, I do not clean this column in this case study.

Resources

To attempt this case study, I downloaded the TSAClaims2002_2017.csv file.

image image image

Results

Here are the results for the overall analysis and a report with the selected state of California.

  • Date issues in the overall data

image

  • Claims per year of Incident_Date in the overall data

image

image

  • The frequency values for Claim_Type for the selected state

image

  • The frequency values for Claim_Site for the selected state

image

  • the frequency values for Disposition for the selected state

image

  • The mean, minimum, maximum, and sum of Close_Amount for the selected state to the nearest integer.

image