introlab/odas

What does the raw output mean?

Femtofirst opened this issue · 1 comments

I am using a minidsp 8-channel microphone for running ODAS, and below is some raw output from terminal - I know the first part is the result of SSL and the second part is the result of SST, the 'x,y,z ' in SSL result indicates the 3d vector of sound sources in the space and the 'E' stands for energy, indicating the intensity of sound.

{
"timeStamp": 2351,
"src": [
{ "x": -0.500, "y": 0.162, "z": 0.851, "E": 0.268 },
{ "x": -0.564, "y": -0.097, "z": 0.820, "E": 0.159 },
{ "x": -0.199, "y": 0.274, "z": 0.941, "E": 0.131 },
{ "x": -0.906, "y": 0.420, "z": 0.059, "E": 0.105 }
]
}
{
"timeStamp": 2351,
"src": [
{ "id": 0, "tag": "", "x": 0.000, "y": 0.000, "z": 0.000, "activity": 0.000 },
{ "id": 34, "tag": "dynamic", "x": -0.458, "y": -0.807, "z": 0.374, "activity": 0.000 },
{ "id": 0, "tag": "", "x": 0.000, "y": 0.000, "z": 0.000, "activity": 0.000 },
{ "id": 0, "tag": "", "x": 0.000, "y": 0.000, "z": 0.000, "activity": 0.000 }
]
}

And I am wondering:

  1. What do the "x,y,z" , the "ID" tag and the 'activity' in the SST result mean?
  2. Why there are only 4 lines of output for each?
  3. Which sound source is tracked in the result of SST? Is it the one with the highest energy?

Ahh, figured it out. The ODAS paper is very helpful and explains very well: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9131248/