Wearable sensors for Parkinson’s disease: which data are worth collecting for training symptom detection models

Michael J. Fox Foundation for Parkinson’s Research Clinician Input Study (CIS-PD) Wireless Adhesive Sensor Sub-Study

Overview
Repo Contents
System Requirements
Installation Guide
Demo
Issues

Overview

Machine learning algorithms that use data streams captured from soft wearable sensors have the potential to automatically detect PD symptoms and inform clinicians about the progression of disease. However, these algorithms must be trained with annotated data from clinical experts who can recognize symptoms, and collecting such data is costly. Understanding how many sensors and how much labeled data are required is key to successfully deploying these models outside of the clinic. Here we recorded movement data using 6 flexible wearable sensors in 20 individuals with PD over the course of multiple clinical assessments conducted on one day and repeated two weeks later. Participants performed 13 common tasks, such as walking or typing, while a clinician rated the severity of symptoms (bradykinesia and tremor). We then trained convolutional neural networks and statistical ensembles to detect whether a segment of movement showed signs of bradykinesia or tremor based on data from tasks performed by other individuals. Our results show that a single wearable sensor on the back of the hand is sufficient for detecting bradykinesia and tremor in the upper extremities, while using sensors on both sides does not improve performance. Increasing the amount of training data by adding other individuals can lead to improved performance, but repeating assessments with the same individuals - even at different medication states - does not substantially improve detection across days. Our results suggest that PD symptoms can be detected during a variety of activities and are best modeled by a dataset incorporating many individuals.

Repo Contents

code: Python 3.5 package code as Jupyter Notebook files. 'NewPaperAnalysis' contains the results shown in the paper, ordered by section. 'CNNModels' contains the code to build and train the Convolutional Neural network.
helperFcns: Accompanying helper functions to extract sensors data clips and compute features from the raw sensors data.
tests: Simulated test data for running the Jupyter Notebook files. Please see the Demo section for information about availability of the dataset used in this study.

System Requirements

Hardware Requirements

The code in the CIS_PD-NDM repo can be run on a standard computer with enough RAM to support processing of the complete dataset as defined by the user. For absolute minimum performance, a computer with 4 GB of RAM. For optimal performance, the following specifications are recommended:

RAM: 16+ GB

CPU: 4+ Cores, 2.8+ GHz/core

Additionally, CNNModels.ipynb uses the keras package in training the convolutional neural networks. Although keras can be run with a CPU, for optimal performance we recommend running CNNModels.ipynb on a computer with a dedicated GPU. The code has been tested on the following GPUs:

NVIDIA GeForce GTX TITAN X

NVIDIA GeForce GTX 1050

The runtimes were generated using a computer with 32 GB RAM, 4 cores @ 2.8 GHz (i7-7700HQ), an NVIDIA GeForce GTX 1050 GPU, and an internet speed of 20 Mbps.

Software Requirements

OS Requirements

The package development was tested on Mac OSX and Windows operating systems, and has been tested on the following versions:

Mac OSX: Sierra 10.12.6

Windows: 7 and 10

All of the packages used to run these Jupyter Notebook files were installed using either pip or Anaconda and should be compatible across all platforms. We ran the code on Anaconda using Python 3.6.2 or higher.

Four main packages used in the code were tested and developed using the versions listed below:

pandas: 0.20.3+
keras: 2.0.2
tensorflow: 1.8.0
theanos: 0.9.0

Installation Guide

Users should first install the latest version of Anaconda using the following link and downloading the installer or appropriate version for Python 3.6 version. This will automatically install a majority of the default Python packages needed, along with Jupyter Notebook.

Package Installation

There are a number of additional packages and software required to fully run all of the code in the CIS_PD-NDM repository.

Features Calculation

For features calculation on the raw data, the nolds package is required and can be installed using the following command through pip:

pip install nolds

and will complete in no longer than 1 minute on a recommended machine.

Additionally, the nolds .whl file has been provided and is located here and can be installed using the following command:

sudo pip install --upgrade nolds-0.4.1-py2.py3-none-any.whl

CNN Calculation

In order to run CNNModels.ipynb, an installation of keras and a backend of either tensorflow or theano is required.

TensorFlow

tensorflow can be installed using the following command on a dedicated GPU machine through pip

pip3 install --upgrade tensorflow-gpu

or for a CPU-only machine:

pip3 install --upgrade tensorflow

Additional installation instructions for tensorflow can be found here.

Theano

theano can be installed using conda with the following command:

conda install theano pygpu

Additional installation instructions for theano can be found here.

Keras

Finally, once a tensorflow or theano backend has been chosen and installed, the last package required to run all of the code is the installation of keras. This can be done through pip using the following command:

pip install keras

Alternatively, one can also use conda to install keras.

Additional installation instructions for keras can be found here.

GPU Drivers

Of note for the NVIDIA graphics cards that the code was tested on is the requirement of installing the appropriate CUDA drivers, if the graphics card is CUDA-capable. For the NVIDIA GeForce GTX 1050 used for a majority of the code, CUDA 9.2 was installed and tested when running the CNN operation code.

More information can be found on the CUDA Toolkit Documentation.

Installation Issues

If you encounter any issues with installing any of these required packages, or still encounter issues running the code after successfully installing them, please raise an Issue.

Demo

Last updated: 6/14/2018

The dataset used to support the findings of this publication are available from the Michael J. Fox Foundation but restrictions apply to the availability of these data, which were used under license for this study. The Michael J. Fox Foundation plans to release the dataset used in this publication alongside a significant, additional portion of related PD data from a separate smartwatch as part of a community analysis in the larger CIS-PD study timeline. Data are however available from the authors upon reasonable request and with permission from the Michael J. Fox Foundation.

Currently, a limited "toy" dataset containing simulated data using identical sensors and sensor placement is available here. The dataset is limited in that it does not contain all of the tasks performed in the actual study and does not encompass multiple trials across multiple days of data. Structurally, however, it is identical to the study dataset. The code in this reposititory has been commented / modified to run while using the limited "toy" dataset. These changes will be reverted once the main dataset is available.

If you encounter any issues with playing around with the "toy" dataset, please raise an Issue. It is probable that many of these issues will resolve themselves once the complete dataset used in the study is available, with permission from the Michael J. Fox Foundation as they follow their scheduled CIS-PD data release timeline.

bmswgnp/CIS_PD-NDM