The HDIN dataset: A Real-world Indoor UAV Dataset with Multi-task Labels for Visual-based Navigation
This repository contains the data preprocessing code for the original HDIN dataset.
If you use HDIN dataset in an academic context, please cite the following publication:
Paper: The HDIN dataset
@Article{drones6080202,
AUTHOR = {Chang, Yingxiu and Cheng, Yongqiang and Murray, John and Huang, Shi and Shi, Guangyi},
TITLE = {The HDIN Dataset: A Real-World Indoor UAV Dataset with Multi-Task Labels for Visual-Based Navigation},
JOURNAL = {Drones},
VOLUME = {6},
YEAR = {2022},
NUMBER = {8},
ARTICLE-NUMBER = {202},
URL = {https://www.mdpi.com/2504-446X/6/8/202},
ISSN = {2504-446X},
ABSTRACT = {Supervised learning for Unmanned Aerial Vehicle (UAVs) visual-based navigation raises the need for reliable datasets with multi-task labels (e.g., classification and regression labels). However, current public datasets have limitations: (a) Outdoor datasets have limited generalization capability when being used to train indoor navigation models; (b) The range of multi-task labels, especially for regression tasks, are in different units which require additional transformation. In this paper, we present a Hull Drone Indoor Navigation (HDIN) dataset to improve the generalization capability for indoor visual-based navigation. Data were collected from the onboard sensors of a UAV. The scaling factor labeling method with three label types has been proposed to overcome the data jitters during collection and unidentical units of regression labels simultaneously. An open-source Convolutional Neural Network (i.e., DroNet) was employed as a baseline algorithm to retrain the proposed HDIN dataset, and compared with DroNet’s pretrained results on its original dataset since we have a similar data format and structure to the DroNet dataset. The results show that the labels in our dataset are reliable and consistent with the image samples.},
DOI = {10.3390/drones6080202}
}
Due to limitations that the current public datasets are mostly collected from outdoor environments which lead to the limitations of indoor generalization capabilities, we proposed an HDIN indoor dataset by collecting data only based on the UAV and its onboard sensors with scaling factor labeling methods to overcome the data jitters and unidentical label units. This repository contains the code to preprocess the original HDIN dataset for visual-based navigation based on Multi-task supervised learning.
This code has been tested on Ubuntu 18.04, and on Python 3.6.
Dependencies:
- TensorFlow 1.5.0
- Keras 2.1.4 (Make sure that the Keras version is correct!)
- NumPy 1.12.1
- OpenCV 3.1.0
- scikit-learn 0.18.1
- Python gflags
- Python matplotlib
- h5py 2.10.0
The dataset can be downloaded from here: Dataset
Once Udacity dataset is downloaded, extract the contents. After extraction, the original processed structure of the entire dataset should look like this:
HDIN
training/
collision001/
images/
labels.txt
steer001
images/
label.txt
...
validation
collision006/
images/
labels.txt
steer009
images/
label.txt
...
testing
collision011
images/
labels.txt
steer010
images/
label.txt
...
Download this repository https://github.com/Yingxiu-Chang/HDIN-Dataset.git
and copy the HDIN/
folder into this repository path.
There are three scaling factor labeling methods to process the original dataset:
- Expected Steering labeling method
python dataset_preprocessing.py [methods]
Use [methods]
to set the labeling method.
Example:
python dataset_preprocessing.py expected_steering
- Fitting Angular Velocity labeling method Example:
python dataset_preprocessing.py fitting_angular_velocity
- Scalable Angular Velocity labeling method Example:
python dataset_preprocessing.py scalable_angular_velocity
The final structure of the steering dataset should look like this:
HDIN
training/
collision001/
images/
labels.txt
steer001
images/
(label.txt)
sync_steering.txt
...
validation
collision006/
images/
labels.txt
steer009
images/
(label.txt)
sync_steering.txt
...
testing
collision011
images/
labels.txt
steer010
images/
(label.txt)
sync_steering.txt
...
- Download the DroNet repository.
git clone https://github.com/uzh-rpg/rpg_public_dronet.git
- Copy the training, validation and testing sets from the processed final HDIN dataset to the DroNet path.
- Modify the 118-122 lines of
cnn.py
in DroNet:
Generate training data with real-time augmentation
train_datagen = utils.DroneDataGenerator(rotation_range = 0.2,
rescale = 1./255,
width_shift_range = 0.2,
height_shift_range=0.2)
to
train_datagen = utils.DroneDataGenerator(rescale = 1./255)
- Following the instructions of DroNet to train the dataset. Example (RGB images):
python cnn.py --experiment_rootdir='./model/xxx' --train_dir='./training' --val_dir='./validation' --batch_size=16 --epochs=150 --log_rate=25 --img_mode='rgb'
Example (Gray images):
python cnn.py --experiment_rootdir='./model/xxx' --train_dir='./training' --val_dir='./validation' --batch_size=16 --epochs=150 --log_rate=25
- Evaluate model performances. Example (RGB images):
python evaluation.py --experiment_rootdir='./model/xxx' --weights_fname='model_weights_xxx.h5' --test_dir='./testing' --img_mode='rgb'
Example (Gray images):
python evaluation.py --experiment_rootdir='./model/xxx' --weights_fname='model_weights_xxx.h5' --test_dir='./testing'