R3VIVAL - Repository of Room Responses and 360 Videos of a Variable Acoustics Lab

SUMMARY OF THE DATA

This datset contains Spatial Room Impulse Responses (SRIR) and 360 stereoscopic video recordings of a variable acoustics lab. The dataset contains:

Channel	Position	Azimuth	Elevation	Microphone
1	Front	0	0	DPA 4060
2	Back	180	0	DPA 4060
3	Right Top	90	45	DPA 4060
4	Left Top	90	45	DPA 4060
5	Right Bottom	270	-45	DPA 4060
6	Left Bottom	270	-45	DPA 4060
7	Center	N/A	N/A	Earthworks M50

Render Binaural Room Impulse Responses
- Set up the Binaural SDM Toolbox and its dependencies²
- Run the example file
Generate video for a specific loudspeaker arrangement
Combine the data with the Audiovisual Speech Corpus³ to generate synthetic multi-talker scenes. Note that the link on the original publication is no longer available. A mirror source is provided in⁴

See the CONTRIBUTING file for how to help out.

R3VIVAL is CC-BY-4.0 licensed, as found in the LICENSE file.

Sebastia V. Amengual (samengual@meta.com)

Amengual Garí, S. V., Arend, J. M., Calamia, P. T., & Robinson, P. W. (2021). Optimizations of the spatial decomposition method for binaural reproduction. Journal of the Audio Engineering Society, 68(12), 959-976. https://doi.org/10.17743/jaes.2020.0063 ↩
https://github.com/facebookresearch/BinauralSDM/ ↩
Kishline, L. R., Colburn, S. W., & Robinson, P. W. (2020). A multimedia speech corpus for audio visual research in virtual reality (L). The Journal of the Acoustical Society of America, 148(2), 492-495. https://doi.org/10.1121/10.0001670 ↩
https://drive.google.com/drive/u/1/folders/1E_sC5SfjZCJOnkLkIN7MJ38-bwkAHQkY ↩