showlab/EgoVLP

About the verb frequency

dreamerlin opened this issue · 0 comments

Thanks for your great work!

I calculated the the statistics on the created word annotations in EgoClip by myself:

#C C looks around 79405
#C C walks around 28064
#C C turns around 13345
#C C moves around 7310
#C C walks around the room 5806
#C C looks around the room 5531
#C C walks in the room 4641
#C C walks around the house 4136
#C C stands up 3907
#C C adjusts the camera 3762

It seems the most frequent verbs are look and walk. But in the Fig. 7(a) of your paper, they are put and take.
image


I also notice that in B.1 (iv) you removed narrations less than 3 words like #C C looks, but #C C looks around has 4 words, so it should not be excluded. Have you filtered some sentences like #C C looks around ?