/hive-project

The objective of this project is to create hive tables, store data from local to HDFS if the files don't exist already and make dynamic partitions out of columns.

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Dataset Description

The Human Activity Recognition 70+ (HAR70+) dataset is a professionally-annotated dataset containing 18 fit-to-frail older-adult subjects (70-95 years old) wearing two 3-axial accelerometers for around 40 minutes during a semi-structured free-living protocol. The sensors were attached to the right thigh and lower back.

Dataset Characteristics Multivariate, Time-Series

Subject Area Computer Science

Associated Tasks Classification

Feature Type Real

Instances 2259597

Features 6

DATASET INFORMATION

For what purpose was the dataset created?

The purpose was to train machine learning models for human activity recognition on professionally-annotated accelerometer data of fit-to-frail older adults.

Additional Information

The HAR70+ dataset contains recordings of 18 older adult participants wearing two 3-axial Axivity AX3 accelerometers for around 40 minutes in a semi-structured free-living setting. The participants' age was between 70 and 95 years. Five of the 18 subjects used walking aids during data recording. One sensor was attached to the right front thigh and the other to the lower back. The provided sampling rate is 50Hz. Video recordings of a chest-mounted camera were used to annotate the performed activities frame-by-frame.

Each subject's recordings are provided in a separate .csv file. One such .csv file contains the following columns:

  1. timestamp: date and time of recorded sample
  2. back_x: acceleration of back sensor in x-direction (down) in the unit g
  3. back_y: acceleration of back sensor in y-direction (left) in the unit g
  4. back_z: acceleration of back sensor in z-direction (forward) in the unit g
  5. thigh_x: acceleration of thigh sensor in x-direction (down) in the unit g
  6. thigh_y: acceleration of thigh sensor in y-direction (right) in the unit g
  7. thigh_z: acceleration of thigh sensor in z-direction (backward) in the unit g
  8. label: annotated activity code

The dataset contains the following annotated activities with the corresponding coding scheme: 1: walking 3: shuffling 4: stairs (ascending) 5: stairs (descending) 6: standing 7: sitting 8: lying

Has Missing Values?

No

Dataset Link: http://archive.ics.uci.edu/dataset/780/har70