/KArSL

Primary LanguageJavaScript

KArSL

KArSL (KFUPM Arabic Sign Language) is an Arabic sign language (ArSL) database collected using Microsoft Kinect V2. The database consists of 502 sign words constituting the sign words of eleven chapters of ArSL dictionary (Letters, Numbers, Health, Common verbs, Family, Characteristics, Directions and places, Social relationships, In house, Religion, and Jobs and professions). Each sign of the database is performed by three professional signers. The signers involved in this database are all male and their age is between 30 and 40 years. Each signer repeated each sign 50 times which resulted in a total of 75,300 samples of the whole database (502 x 3 x 50) as shown in the table below.

KArSL is the first database for ArSL with such a large number of samples. Table 2 shows a comparison between our database and the top five largest databases used in the literature.

Setup and recording software

All signs of KArSL are recorded in an unconstrained environment. We didn’t use dedicated lights in the recording room as the room lights were adequate and no shadow is shown in the records. We used fixed background (green) to facilitate background removal for researchers who prefer using color video recording. In addition, the signers were not restricted to wear specific clothes or remove eye glasses or watches. Each sign is recorded by each signer in two sessions where the signer wearing different clothes in each session. To add more variety to the database, some signs, alphabets, are performed alternately between the left and right hands of the signer.

Samples of the data

All signs are available in three modalities: (a) RGB, (b) depth, and (c) skeleton joint points as shown the followin figure.

Citing

If you use KArSL dataset, we kindly ask you to cite KArSL: Arabic Sign Language Database paper:

@article{sidig2021karsl, 
  title={KArSL: Arabic Sign Language Database}, 
  author={Sidig, Ala Addin I and Luqman, Hamzah and Mahmoud, Sabri and Mohandes, Mohamed}, 
  journal={ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP)},  
  volume={20}, 
  number={1}, 
  pages={1--19}, 
  year={2021}, 
  publisher={ACM New York, NY, USA} 
}

Dataset download

There are three subsets of the dataset:

KArSL-100

This dataset consists of 100 dynamic signs of KArSL dataset (from signID 0071 to 0170). Please follow the links below to download it:

To download the raw video files of this data, please follow the links below:

  • RGB video files [Signer 01, Signer 02, Signer 03]
  • Depth data [Signer 01, Signer 02, Signer 03]

KArSL-190

This dataset consists of 190 static and dynamic signs of KArSL dataset (from signID 0001 to 0190). Please follow the links below to download it:

To download the raw video files of this data, please follow the links below:

  • RGB video files
  • Depth data

KArSL-502

This dataset consists of 502 static and dynamic signs (whole KArSL dataset signs) (from signID 0001 to 0502). Please follow the links below to download it:

To download the raw video files of this data, please follow the links below:

  • RGB video files
  • Depth data