This open dataset from the ITU-T P.1203 standardization procedure (P.NATS) contains the following data:
- 2 training and 2 validation databases
- Feature data:
- Mode 0 (metadata-level)
- Mode 1 (packet-header-level)
- Mode 2 (bitstream-level, 2 percent)
- Mode 3 (bitstream-level)
- Subjective ratings
- Model output scores based on
itu-p1203
version 1.2.1
Note: A previous version of this dataset contained a bug in the calculation, which lead to the wrong stalling values being used for model input. The performance reported in the ACM MMsys paper thus appears worse. The correct values are listed here. Thanks to Tran Thi Thanh Huyen at the University of Aizu for observing this error.
The data was analyzed with the ITU-T P.1203 implementation from the same authors. Please carefully read the license of that software.
To re-calculate the data, you have to:
- Install Python 3 and run
pip3 install --user -r requirements.txt
- Extract the
.tar.bz2
feature files in thefeatures
folder - Run
python3 create_model_outputs.py -c
The performance of the model as calculated based on this dataset is as follows:
Context | Mode | Mean RMSE | Mean RMSE* | Mean PLCC | Mean SROCC |
---|---|---|---|---|---|
mobile | 0 | 0.370 | 0.181 | 0.917 | 0.893 |
mobile | 1 | 0.375 | 0.184 | 0.913 | 0.896 |
mobile | 2 | 0.403 | 0.202 | 0.900 | 0.888 |
mobile | 3 | 0.426 | 0.225 | 0.889 | 0.880 |
pc | 0 | 0.462 | 0.265 | 0.869 | 0.838 |
pc | 1 | 0.439 | 0.259 | 0.883 | 0.874 |
pc | 2 | 0.398 | 0.209 | 0.910 | 0.897 |
pc | 3 | 0.367 | 0.174 | 0.926 | 0.908 |
If you use this software and data, or derivates of it, in your research, you must
-
Include the link to this repository
-
Cite the following publication:
Robitza, W., Göring, S., Raake, A., Lindegren, D., Heikkilä, G., Gustafsson, J., List, P., Feiten, B., Wüstenhagen, U., Garcia, M.-N., Yamagishi, K., Broom, S. (2018). HTTP Adaptive Streaming QoE Estimation with ITU-T Rec. P.1203 – Open Databases and Software. In 9th ACM Multimedia Systems Conference. Amsterdam.
@inproceedings{Robitza2017d, address = {Amsterdam}, author = {Robitza, Werner and G{\"{o}}ring, Steve and Raake, Alexander and Lindegren, David and Heikkil{\"{a}}, Gunnar and Gustafsson, J{\"{o}}rgen and List, Peter and Feiten, Bernhard and W{\"{u}}stenhagen, Ulf and Garcia, Marie-Neige and Yamagishi, Kazuhisa and Broom, Simon}, booktitle = {9th ACM Multimedia Systems Conference}, doi = {10.1145/3204949.3208124}, isbn = {9781450351928}, title = {{HTTP Adaptive Streaming QoE Estimation with ITU-T Rec. P.1203 – Open Databases and Software}}, year = {2018} }
The following features are available:
features/features_mode0.csv
: Per-output-sample aggregated features used for Pv mode 0, as parsed by the measurement window, columns:bitrate_kbps_segment_size
: Actual segment bitratebitrate_kbps_target
: Encoding target bitratecoding_height
: Encoding height in pixelscoding_res
: Number of encoded pixelscoding_width
: Encoding width in pixelsdisplay_height
: Height of the display in pixelsdisplay_res
: Number of pixels in displaydisplay_width
: Width of the display in pixelsdts
: Frame decoding timestampsframerate
: Encoding frames per secondsample_index
: Index of the output sample (1 per second)pvs_id
: Identifier of the PVS
features/features_mode1.csv
: Per-output-sample aggregated features used for Pv mode 1, as parsed by the measurement window, columns:bitrate_kbps_segment_size
: Actual segment bitratebitrate_kbps_target
: Encoding target bitratecoding_height
: Encoding height in pixelscoding_res
: Number of encoded pixelscoding_width
: Encoding width in pixelsdisplay_height
: Height of the display in pixelsdisplay_res
: Number of pixels in displaydisplay_width
: Width of the display in pixelsdts
: Frame decoding timestampsframerate
: Encoding frames per secondgop_length
: GOP length in secondsi_sizes_average
: Average size of I frames in Bytesiframe_ratio
: Ratio between I and Non-I frame sizesnoni_sizes_average
: Average size of non-I framessample_index
: Index of the output sample (1 per second)pvs_id
: Identifier of the PVS
features/features_mode2.tar.bz2
: Per-output-sample aggregated features used for Pv mode 2, as parsed by the measurement window. One CSV file for each PVS, see column description below. IfBS_TwoPercentQP1
is not present, 2% of the bitstream are not enough to provide bitstream data, hence the features from mode 1 needs to be used for calculation.features/features_mode3.tar.bz2
: Per-output-sample aggregated features used for Pv mode 3, as parsed by the measurement window. One CSV file for each PVS, with columns:pvs_id
: Identifier of the PVSmode
: (fixed to 3)codec
: Video codec (fixed to H.264)index
dts
: Frame decoding timestampstypes
: Frame typessizes
: Frame sizesframerate
: Encoding frames per secondbitrate_kbps_target
: Encoding target bitratecoding_res
: Number of encoded pixelsdisplay_res
: Number of pixels in displayquant
: Model-internal parameterBS_DecodedMbs
: Number of decoded macroblocksBS_MbTypes
: Count of macroblock types, meaning of array indices:0
: Skipped1
: Forward2
: Backward3
: Bidirect4
: Direct5
: Intra46
: Intra16
BS_Av_QPBB
: Average QP of 100% of bitstream.BS_TwoPercentQP1
: Average QP of 2% of bitstream, used for running model in Mode 2.
The subjective data is contained in the following files:
subjective_scores/mos.csv
: Per-PVS subjective MOS with 95% CI, columns:pvs_id
: Identifier of the PVScontext
: "pc" or "mobile"mos
: MOSn
: Number of ratings considered for MOSsd
: Standard deviation of the MOSci
: 95% confidence interval
subjective_scores/ratings.csv
: Per-subject ratings for each PVS, columns:pvs_id
: Identifier of the PVScontext
: "pc" or "mobile"subject
: Subject identifier (unique only within context/database)rating
: The subject's rating from 1 to 5 (Bad to Excellent) according to P.910 ACR scale.
The following data relates to model output:
O21.csv
: Per-output-sample O21 (audio quality) data, valid for all modes, columns:pvs_id
: Identifier of the PVSsample_index
: Index of the output sample (1 per second)O21
: Audio quality score
O22.csv
: Per-output-sample O22 (video quality) data, for all modes, columns:pvs_id
: Identifier of the PVSmode
: Calculated modesample_index
: Index of the output sample (1 per second)O22
: Video quality score
O46.csv
: Per-PVS O46 (video quality) data, for all modes, columns:pvs_id
: Identifier of the PVSmode
: Calculated modecontext
: PC or mobileO46
: Integrated audiovisual quality score
Detailed model input for and output as produced by the P.1203 software:
mode0
: One JSON file for each PVS and context (PC/mobile), containing all output keys, as well as one JSON file for each PVS and context (PC/mobile) with the suffix-input.json
that can be used as input file for the P.1203 software.mode1
: same as abovemode2
: same as abovemode3
: same as above
Subjective test database design:
test_configs/*.yaml
: YAML file containing test configurationaudioVisualQualityLevels
: list of quality representations, with the following entries:- video height
- video target bitrate
- quality level identifier
- audio target bitrate
defaultVideoDurationInSec
: PVS durationdisplayHeight
: height of the display, fixed to 1080pffmpegCommonAudioParams
: common settings for ffmpeg audio encodingffmpegCommonVideoParams
: common settings for ffmpeg video encodinghrcList
: list of HRCs, with each key being the HRC IDeventList
: list of events in each HRC, each event being a tuple of (event type, duration in seconds), where event type can be a quality representation ID or a "stall" event
segmentDurationInSec
: default duration of video segmentssrcList
: list of SRC IDs and a hash of their nametestName
: database IDx264CommonParams
: common settings for x264 encoder
test_configs/*.svg
: SVG plot of database design
Copyright 2018 Deutsche Telekom AG, Ericsson AB, NETSCOUT Systems Inc.
Permission is hereby granted, free of charge, to use this dataset for non-commercial research purposes.
NO EXPRESS OR IMPLIED LICENSES TO ANY PARTY'S PATENT RIGHTS ARE GRANTED BY THIS LICENSE. THE DATASET IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE DATASET OR THE USE OR OTHER DEALINGS IN THE DATASET.
Main developers:
- David Lindero, Ericsson AB
- Werner Robitza, Deutsche Telekom AG / Technische Universität Ilmenau
Contributors:
- Marie-Neige Garcia, Technische Universität Berlin
- Steve Göring, Technische Universität Ilmenau
- Alexander Raake, Technische Universität Ilmenau
- Peter List, Deutsche Telekom AG
- Bernhard Feiten, Deutsche Telekom AG
- Ulf Wüstenhagen, Deutsche Telekom AG
- Jörgen Gustafsson, Ericsson AB
- Gunnar Heikkilä, Ericsson AB
- Junaid Shaikh, Ericsson AB
- Simon Broom, NETSCOUT Systems Inc.