ERGL

Arxiv file is available: https://arxiv.org/abs/2210.15366

The ERGL is uploading....

Scene graphs consist of the top 25 events

The top 25 events simply depend on the entire dataset and are not specifically selected for each single target scene. So for each scene graph, some events seem to be a little bit strange in the graph tree. At the semantic level, these 25 classes of events are slightly insufficient in describing 10 different classes of scenes. The top 25 events are automatically chosen by the classification model without involving artificial prior knowledge.

1. airport

2. bus

3. metro

4. metro station

5. park

6. public square

7. shopping mall

8. street pedestrian

9. street traffic

10. tram

Averaged multi-dimensional edge values between nodes in samples of different acoustic scenes

1. airport

2. bus

3. metro

4. metro station

5. park

6. public square

7. shopping mall

8. street pedestrian

9. street traffic

10. tram