shamanez/Self-Supervised-Embedding-Fusion-Transformer

2-class problem

Chenpuh opened this issue · 12 comments

Is 2-class the following code in fairseq/data/raw_audio_text_video_dataset.py?
If so, why are the sentiment_score 0 and 1?

            self.emotion_dictionary = {   #modei senti 2 class
            
                '0': 0,
                '1':1
            }

Sorry, I don't understand what you mean.

in the paper MSAF Multimodal Split Attention Fusion:
For binary classification, we consider [-3, 0) labels as negative and (0, 3] as positive.

I want to know what 0 and 1 stand for in your repository?

In *.csv, the format is as follows:

FileName,sentiment_score
03bSnISJMiM_1,2.400000095367432
03bSnISJMiM_3,-1.0
03bSnISJMiM_4,-1.75
03bSnISJMiM_5,0.0
03bSnISJMiM_6,0.0
03bSnISJMiM_7,0.8000000119209291
03bSnISJMiM_9,0.20000000298023224
03bSnISJMiM_11,-0.5
03bSnISJMiM_12,2.200000047683716
0h-zjBukYpk_1,-0.4000000059604645
0h-zjBukYpk_2,-2.799999952316284
0h-zjBukYpk_3,-0.20000000298023224
0h-zjBukYpk_4,-1.399999976158142
0h-zjBukYpk_5,0.8000000119209291
0h-zjBukYpk_6,-1.399999976158142
0h-zjBukYpk_8,0.8000000119209291

They are all floating point numbers. How can we make them 0 or 1?

In mosei/BA senti/label_file_*.csv, the format is as follows:

FileName,SEVEN,TWO
5Fs_7A_V2kA_0,1,1
5Fs_7A_V2kA_1,1,1
5Fs_7A_V2kA_2,1,1
5Fs_7A_V2kA_3,1,1
5Fs_7A_V2kA_4,1,1
Wfn7XvSEwUA_0,-2,0
Wfn7XvSEwUA_1,0,non
Wfn7XvSEwUA_2,2,1
Wfn7XvSEwUA_3,1,1
Wfn7XvSEwUA_4,0,non
Wfn7XvSEwUA_5,1,1
Wfn7XvSEwUA_6,1,1
Wfn7XvSEwUA_7,1,1
47472_0,-2,0
47472_1,0,non
47472_2,-1,0
47472_3,-2,0
47472_4,-2,0

How are they generated? Ceiling function?

Both sentiment_score or BA senti can be used in this repository, right?
Or need to modify some code to use it?

In other words, 7-classes and 2-classes are the same, but the evaluation criteria are different. Is 7-classes more detailed?

Thank you very much for your help.