TGRS-HRRSD-Dataset: High Resolution Remote Sensing Detection (HRRSD)

I. NOTE: JPEG files are available on BaiduCloud, GoogleDrive, and an ipv6 site bt.byr.cn

HRRSD contains 21,761 images acquired from Google Earth and Baidu Map with the spatial resolution from 0.15-m to 1.2-m.
There are 55,740 object instances in HRRSD.
HRRSD contains 13 categories of RSI objects.

Moreover, this dataset is divided as several subsets, image numbers in each subset are 5401 for ‘train’, 5417 for ‘val’, and 10943 for ‘test’. And ‘train-val’ subset is a merge of ‘train’ and ‘val’.

II. Mean and Std

In most current object detection systems, means and std values of datasets are required.

You may refer to:

mean=[123.675, 116.28, 103.53], std=[58.395, 57.12, 57.375]

Moreover, you can compute the values by yourself with file OPT2017/avr_std_detection_sets.py:

$ cd your_HRRSD_path/OPT2017
$ python avr_std_detection_sets.py 500

III. Folders

Labels

/OPT2017/Annotations: *.xml
/OPT2017/labels: *.txt with the form of (class x y width height)

Images

/OPT2017/JPEGImages: *.jpg

Dataset Division

/OPT2017/ImageSets/Main: Division of the dataset.

IV. Statistics and Benchmark

Statistics

Label	Name	N_Train	N_Val	N_Trainval	N_Test	N_All	Mean Resized Scale /pixel	Resized Scale Std /pixel
1	ship	950	948	1898	1988	3886	167.44	110.37
2	bridge	1123	1121	2244	2326	4570	246.10	110.53
3	ground track field	859	856	1717	2017	3734	276.50	100.65
4	storage tank	1099	1092	2191	2215	4406	125.60	68.41
5	basketball court	923	920	1843	2033	3876	108.19	57.46
6	tennis court	1043	1040	2083	2212	4295	102.71	38.80
7	airplane	1226	1222	2448	2451	4899	113.21	67.98
8	baseball diamond	1007	1004	2011	2022	4033	231.61	117.85
9	harbor	967	964	1931	1953	3884	163.96	94.16
10	vehicle	1188	1186	2374	2382	4756	41.96	9.99
11	crossroad	903	901	1804	2219	4023	220.54	59.24
12	T junction	1066	1065	2131	2289	4420	198.71	54.88
13	parking lot	1241	1237	2478	2480	4958	122.85	54.45

In this table, N_* refers to numbers of objects. 'Train', 'Val', 'Test' are three subsets of the dataset. 'Mean Resized Scale' shows average scale of each category. 'Resized Scale Std' is the standard deviation of category scale.

Benchmark

Category	YOLO-v2 /%	Fast R-CNN /%	Fast_R-CNN_r50 + GACL-Net /%	Faster R-CNN /%	Faster_R-CNN_r50 + GACL-Net /%
Airplane	84.6	83.3	85.1	90.8	90.8
Baseball Diamond	62.2	83.6	82.6	86.9	87.2
Basketball Court	41.3	36.7	42.1	47.9	49.7
Bridge	79.0	75.1	76.7	85.5	85.6
Crossroad	43.4	67.1	68.7	88.6	88.2
Ground Track Field	94.4	90.0	89.6	90.6	90.7
Harbor	74.4	76.0	78.4	89.4	89.7
Parking Lot	45.8	37.5	39.5	63.3	65.3
Ship	78.5	75.0	74.3	88.5	88.5
Storage Tank	72.4	79.8	80.4	88.7	89.2
T Junction	46.8	39.2	38.8	75.1	75.0
Tennis Court	67.6	75.0	77.0	80.7	80.8
Vehicle	65.1	46.1	50.7	84.0	86.9
Mean AP	65.8	66.5	68.0	81.5	82.1

GACL-Net \cite{lu2020gated} is a method proposed to improve object localization performance. Title of this paper is "Gated and Axis-Concentrated Localization Network for Remote Sensing Object Detection". The suffix "r50" denotes using 50 layer Resnet.

V. FAQ

If any question is met, please contanct me with the e-mail: 1153463027@qq.com.

Qestion 1: AP for the "T junction" class is always NAN or 0, why?

Anwser Q1: In some object detection frameworks, there may be a piece of code like "cls_names = lower( cls_names )". This will set class names to lower case, but class names in xml files contain "T junction" where "T" is uppercase. This actually will cause several problems. The solution is using debug sofwares to find the code of changing word cases and correct it. For the dataset, I won't change the "T junction" labels in xmls currently for lacking time.

VI. Citation

If you find HRRSD dataset useful in your research, please consider citing:

@article{zhang2019hierarchical,
  title={Hierarchical and Robust Convolutional Neural Network for Very High-Resolution Remote Sensing Object Detection},
  author={Zhang, Yuanlin and Yuan, Yuan and Feng, Yachuang and Lu, Xiaoqiang},
  journal={IEEE Transactions on Geoscience and Remote Sensing},
  volume={57},
  number={8},
  pages={5535--5548},
  year={2019},
  publisher={IEEE}
}

For more comparative experimental results, please refer to:

@article{lu2020gated,
  title={Gated and Axis-Concentrated Localization Network for Remote Sensing Object Detection},
  author={Lu, Xiaoqiang and Zhang, Yuanlin and Yuan, Yuan and Feng, Yachuang},
  journal={IEEE Transactions on Geoscience and Remote Sensing},
  volume={58},
  number={1},
  pages={179--192},
  year={2020},
  publisher={IEEE}
}

Liujianhui1986/TGRS-HRRSD-Dataset