/Ambiqual

Ambiqual full reference objective model for predicting listening quality and localization accuracy of ambisonic spatial audio.

Primary LanguagePythonApache License 2.0Apache-2.0

Ambiqual

Python implementation of Ambiqual full reference objective model as presented in the following paper:

Narbutt M, Skoglund J, Allen A, Chinen M, Barry D, Hines A. AMBIQUAL: Towards a Quality Metric for Headphone Rendered Compressed Ambisonic Spatial Audio. Applied Sciences. 2020; 10(9):3188. https://doi.org/10.3390/app10093188

Installation

This package can be installed using pip:

pip install git+https://github.com/QxLabIreland/Ambiqual

Usage

The program can be used using the command line tool:

python -m ambiqual --ref /path/to/dir/reference_signal --deg /path/to/dir/degraded_sginal [--level threshold] [--elc elc] [--ignorefreqbands freq_band]

Options

  • ref - Reference audio file.
  • deg - Degraded audio file.
  • level - Intensity binary mask threshold (in dB).
  • elc - Equal loudness correction mode:
    • 0 - No equal loudness correction.
    • 1 - Equal loudness correction by boosting low and high frequencies.
    • 2 - Equal loudness correction by attenuating low and high frequencies.
  • ignorefreqbands - Specifies high-frequency bands to ignore (range: 0 to 32):
    • 0 - All 32 frequency bands are considered.
    • k - Ignores from the k-th to the 32nd frequency bands in calculations.

Example

The intensity binary map threshold is set to -180dB and equal loudness contours are applied (attenuated low and high-frequency bands):

python -m ambiqual --ref validation/audiofiles/castanets_fixed_A60_E60_HOA_REF.wav --deg validation/audiofiles/castanets_fixed_A60_E60_HOA_512k.wav --level -180 --elc 2 --ignorefreqbands 0

Validation

To validate Ambiqual, we used ambiqual_test.py script located in the validation directory to run Ambiqual on a set of ambisonic audio files, as used in the paper's experiments. The resulting listening quality and localisation accuracy are then plotted against subjective scores, similar to Figures 11 and 12 in the paper. Note that due to a bug in the original figure plotting script, the scatter plots in Figure 11 of the paper are incorrect and the correct ones are depicted below.

Alt text
Results of experiment 1 (Figure 11 of the paper)



Alt text
Results of experiment 2 (Figure 12 of the paper)

Citation

If you use this code, please cite both the repository and the associated paper:

DOI

Narbutt M, Skoglund J, Allen A, Chinen M, Barry D, Hines A. AMBIQUAL: Towards a Quality Metric for Headphone Rendered Compressed Ambisonic Spatial Audio. Applied Sciences. 2020; 10(9):3188. https://doi.org/10.3390/app10093188

Licence

This project is licensed under the Apache 2.0 License.