This website contains datasets of Channel State Information (CSI) traces for sign language recognition using WiFi.
Yongsen Ma, Gang Zhou, Shuangquan Wang, Hongyang Zhao, and Woosub Jung. 2018. SignFi: Sign Language Recognition Using WiFi. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2, 1, Article 23 (March 2018), 21 pages. DOI: https://doi.org/10.1145/3191755
A comparision of sign language recognition technologies is shown in the following figure.
This repository contains the following files. You agree to the Terms of Use for SignFi Dataset to download and use the files.
Files | Description | Size |
---|---|---|
dataset_lab_276_dl.mat | Segmented downlink CSI traces for 276 sign words in the lab environment. There are 5,520 instances of 276 sign gestures performed by one user. | 1.44GB |
dataset_lab_276_ul.mat | Segmented uplink CSI traces for 276 sign words in the lab environment. There are 5,520 instances of 276 sign gestures performed by one user. | 1.33GB |
dataset_home_276.mat | Segmented downlink and uplink CSI traces for 276 sign words in the home environment. There are 2,760 instances of 276 sign gestures performed by one user. | 1.37GB |
dataset_lab_150.mat | Segmented downlink CSI traces for 150 sign words in the lab environment. There are 7,500 instances of 150 sign gestures performed by five users. | 1.93GB |
signfi_cnn_example.m | MATLAB source code for training and testing using the dataset. | 3KB |
training_screen_shot.png | A screen shot of the training process | 541KB |
sign_labels.csv | Labels for 276 sign words used in the measurement. | 2KB |
videos/ | This folder contains the videos for 276 basic sign words. These videos are used for the participant to learn and perform sign gestures during data collection. | 105.9MB |
README.md | Readme | 5KB |
Both downlink and uplink CSI traces are included for the 276 sign words in the lab and home environments. Only downlink CSI traces are used in the paper. Please check the paper for more details about experiment setup, measurement procedure, WiFi settings, etc.
The following shows an example of how to train the dataset. Load the dataset csid_lab
and label_lab
into the Matlab workspace by running load('dataset_lab_276_dl.mat');
. Run [net_info, perf] = signfi_cnn_example(csid_lab,label_lab);
to train the neural network and get recognition results. This shows the training process in the following figure.
The following shows how signfi_cnn_example
works.
- Prepare data for training.
tic; % starting time
csi_abs = abs(csi); % CSI amplitude
csi_ang = angle(csi); % CSI phase
csi_tensor = [csi_abs,csi_ang];
word = categorical(label);
t0 = toc; % pre-processing time
tic
and toc
are used to measure time consumption. The size of csid_lab
is size(csid_lab)=(200,30,3,5520)
. csi_tensor
and word
are the CSI input and labels.
- Some parameter settings.
% [M,N,S]: CSI matrix for each instance
% T: the total number of instances
[M,N,S,T] = size(csi_tensor);
Nw = 276; % number of classes
rng(42); % For reproducibility
n_epoch = 10;
learn_rate = 0.01;
l2_factor = 0.01;
- Divide the dataset into training and testing subsets.
K = 5;
cv = cvpartition(T,'kfold',K); % 20% for testing
k = 1; % for k=1:K
trainIdx = find(training(cv,k));
testIdx = find(test(cv,k));
trainCsi = csi_tensor(:,:,:,trainIdx);
trainWord = word(trainIdx,1);
testCsi = csi_tensor(:,:,:,testIdx);
testWord = word(testIdx,1);
valData = {testCsi,testWord};
- Set neural network layers and training options. MATLAB 2017b or newer versions are needed to use
batchNormalizationLayer()
for network layers andValidationData
for training options. FortrainingOptions()
,'ExecutionEnvironmnet'
can be'cpu'
,'gpu'
, or'parallel'
.
% Neural network layer settings
layers = [imageInputLayer([M N S]);
convolution2dLayer(4,4,'Padding',0);
batchNormalizationLayer();
reluLayer();
maxPooling2dLayer(4,'Stride',4);
fullyConnectedLayer(Nw);
softmaxLayer();
classificationLayer()];
% Neural network training options
options = trainingOptions('sgdm','ExecutionEnvironment','parallel',...
'MaxEpochs',n_epoch,...
'InitialLearnRate',learn_rate,...
'L2Regularization',l2_factor,...
'ValidationData',valData,...
'ValidationFrequency',10,...
'ValidationPatience',Inf,...
'Shuffle','every-epoch',...
'Verbose',false,...
'Plots','training-progress');
- Train and test the neural network; calculate recognition accuracy.
[trainedNet,tr{k,1}] = trainNetwork(trainCsi,trainWord,layers,options);
t1 = toc; % training end time
[YTest, scores] = classify(trainedNet,testCsi);
TTest = testWord;
test_accuracy = sum(YTest == TTest)/numel(TTest);
t2 = toc; % testing end time
net_info = tr;
perf = test_accuracy;
- Plot the confusion matrix. Since there are too many classes, it is not reasonable to plot the whole confusion matrix. You can divide the classes into different categories and then plot the confusion matrix.
% plot confusion matrix
ttest = dummyvar(double(TTest))';
tpredict = dummyvar(double(YTest))';
[c,cm,ind,per] = confusion(ttest,tpredict);
plotconfusion(ttest,tpredict);
You need to read and agree to the following terms of use to download and use the SignFi dataset.
1. Definitions
The following terms, unless the context requires otherwise, have the following meanings:
“Data Team”: means the employees and students at the College of William and Mary who are working on the SignFi dataset;
“SignFi Dataset”: means the channel state information traces collected by the Data Team;
“Licensee”, “You”, “Your”, “Yours”: means the person or entity acquiring a license hereunder for access to and use of the SignFi Dataset.
2. Grant of License
The College of William and Mary hereby grants to You a non-exclusive, non-transferable, revocable license to use the SignFi Dataset solely for Your non-commercial, educational, and research purposes only, but without any right to copy or reproduce, publish or otherwise make available to the public or communicate to the public, sell, rent or lend the whole or any constituent part of the SignFi Dataset thereof. The SignFi Dataset shall not be redistributed without the express written prior approval of the College of William and Mary You agree to respect the privacy of those human subjects whose sign gesture data are included in the SignFi Dataset. Do not attempt to reverse the anonymization process to identify specific identifiers including, without limitation, names, postal address information, telephone numbers, e-mail addresses, social security numbers, and biometric identifiers. You agree not to reverse engineer, separate or otherwise tamper with the SignFi Dataset so that data can be extracted and used outside the scope of that permitted in this Agreement.
You agree to acknowledge the source of the SignFi Dataset in all of Your publications and presentations based wholly or in part on the SignFi Dataset. You agree to provide a disclaimer in any publication or presentation to the effect that the College of William and Mary does not bear any responsibility for Your analysis or interpretation of SignFi Dataset.
You agree and acknowledge that the College of William and Mary may hold, process, and store any personal data submitted by You for validation and statistical purposes and for the purposes of the administration and management of SignFi Dataset. You agree that any personal data submitted by You is accurate to the best of his or her knowledge.
The College of William and Mary provides the SignFi Dataset "AS IS," without any warranty or promise of technical support, and disclaims any liability of any kind for any damages whatsoever resulting from use of the SignFi Dataset .
THE COLLLEGE OF WILLIAM AND MARY MAKES NO WARRANTIES, EXPRESS OR IMPLIED WITH RESPECT TO THE SIGNFI DATASET, INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE, WHICH ARE HEREBY EXPRESSLY DISCLAIMED.
Your acceptance and use of the SignFi Dataset binds you to the terms and conditions of this License as stated herein.