cannot run the IEMOCAP dataset on windows

Question

cannot run the IEMOCAP dataset on windows

nijaaouikhalil opened this issue 3 years ago · 24 comments

Hello, could you show the data folder architecture so I understand the way you organised the dataset.
I kept getting errors to segment the data.
I extracted the IEMOCAP_full_release in the data folder the renamed it as IEMOCAP, however, I kept getting errors of files not found.

Answer 1 · 2022-05-20T12:00:19.000Z

Hello,
To avoid taking up too much storage space, folders containing video and dialog audio were removed, and the database structure is as follows (plotted by tree command with depth 3). If you encounter more problems, feel comfortable and state them again.
Of course, you must change the path separator ( "/" for Linux) to ( "\" for Windows) anywhere needed

./IEMOCAP/
├── Session1
│   ├── dialog
│   │   └── EmoEvaluation
│   └── sentences
│       └── wav
├── Session2
│   ├── dialog
│   │   └── EmoEvaluation
│   └── sentences
│       └── wav
├── Session3
│   ├── dialog
│   │   └── EmoEvaluation
│   └── sentences
│       └── wav
├── Session4
│   ├── dialog
│   │   └── EmoEvaluation
│   └── sentences
│       └── wav
└── Session5
    ├── dialog
    │   └── EmoEvaluation
    └── sentences
        └── wav

Answer 2 · 2022-05-24T19:47:59.000Z

Hello @AryaAftab,
Thanks a lot for responding.
I've changed the path separator also the pattern in the read_dataset file (line 14) to '{}.\t(.?)\t[' because it wouldn't match as is
I have also changed the label in the same file to match the name of the wavefile. . ( was throwing file not found errors )
lable = label_format.format(wavename[4], wavename[:].replace("/", "\"))
and yet I still getting errors related to the segmentation step.
Can you confrim that the current main version works ?
Thanks a lot

Answer 3 · 2022-05-24T20:07:56.000Z

you're welcome @nijaaouikhalil,
Yes, the program works properly. The screenshot below shows the correct operation of the function (on Linux).
Why did you change the pattern? You just need to change the path separator.

Answer 4 · 2022-05-24T20:21:41.000Z

Thanks a lot for the quick reply.
I changed the label to the commented one here because otherwise the file doesn't exist.
also for the pattern I accounted for the F000 that follows the name of the wave file

Answer 5 · 2022-05-24T20:40:04.000Z

Don't mention it @nijaaouikhalil,
I think there is a problem with the iemocap_before_segment function in Windows OS. This problem does not exist in Linux OS. From the error, it can be concluded that there is a problem when extracting the filenames of txt (lines 32 to 39).

Answer 6 · 2022-05-25T14:09:25.000Z

can you elaborate what should be the label , the filename and wavename at that stage ( I am not sure why you remove the last 5 characters)
label = label_format.format(wavename[4], wavename[:-5])

also what should the regex match ? my understanding is that it should match the first occurrence of the wave filename (Ses01F_impro01) so we don't care about the (_F000) part .

Answer 7 · 2022-05-25T14:29:38.000Z

Assume you have a wav with Ses01F_impro01_F001.wav name:
Line 32 removes .wav from file:

wavename = filename[:-4] ---> Output: Ses01F_impro01_F001

Our label format is :

label_format = 'Session{}/dialog/EmoEvaluation/{}.txt'

In line 33, you separate session numbers and txt filename:

wavename[4] ---> Output: 1
wavename[:-5] ---> Output: Ses01F_impro01
label = label_format.format(wavename[4], wavename[:-5]) ---> Output: 'Session1/dialog/EmoEvaluation/Ses01F_impro01.txt'

In the end, you find Ses01F_impro01_F001 in your text and extract your label(in this example: neu)

Answer 8 · 2022-05-25T14:41:03.000Z

Thanks a lot @AryaAftab I fixed it, turns out the issue was that I didn't delete the wave files inside the dialog folder and didn't escape them when reading the filenames.

Answer 9 · 2022-05-25T14:58:37.000Z

Think nothing of it @nijaaouikhalil,
If the repository was useful to you, thank you for your support by starring.

Answer 10 · 2022-06-27T07:24:47.000Z

I dont konw why I cant train this model with windows and using 3080, always raise type valueError

and I run this exactly in colab raise the same error when i do in the pycharm

Answer 11 · 2022-06-27T12:07:08.000Z

I dont konw why I cant train this model with windows and using 3080, always raise type valueError and I run this exactly in colab raise the same error when i do in the pycharm

I solve the problem,and the reason is I made some mistakes with the [contents]

Answer 12 · 2022-06-27T14:57:01.000Z

Hi @STU-ECHO,
I was trying to debug your problem, but fortunately, you solved the problem yourself.
If the repository was useful to you, thank you for your support by starring.

Answer 13 · 2022-10-15T11:35:01.000Z

@AryaAftab
Hello, I can't find the file when I run “python train.py -dn "IEMOCAP" -id 7 -at "all" -ln "focal" -v 1 -it "mfcc" -c "disk" -m false”. I want to ask what may be the reason

Answer 14 · 2022-10-15T15:23:39.000Z

Hi @He-jia-wen
The problem created for you in the previous question was created for someone else. Did you read the previous questions and answers?

Answer 15 · 2022-10-15T15:30:00.000Z

Hi @He-jia-wen The problem created for you in the previous question was created for someone else. Did you read the previous questions and answers?

Yes, I have read the previous question. It seems that this question is' - 'and' _ ' I modified the code myself, but there is a new question I want to ask

Answer 16 · 2022-10-15T19:45:33.000Z

@He-jia-wen
I did not understand what exactly your question is. Your error says "you don't have the Ses01M_script02_0.txt file". Are you sure you didn't delete the label files or change their directory?

Answer 17 · 2022-11-11T14:09:50.000Z

I dont konw why I cant train this model with windows and using 3080, always raise type valueError and I run this exactly in colab raise the same error when i do in the pycharm

I solve the problem,and the reason is I made some mistakes with the [contents]

Hello,I meet the same problem with you,How did you solve it exactly in the end?

Answer 18 · 2022-12-06T12:24:42.000Z

Hello,I meet the same problem with you,How did you solve it exactly in the end?

Hello,I meet the same problem with you,How did you solve it exactly in the end?

Answer 19 · 2022-12-06T12:26:10.000Z

I dont konw why I cant train this model with windows and using 3080, always raise type valueError and I run this exactly in colab raise the same error when i do in the pycharm

I solve the problem,and the reason is I made some mistakes with the [contents]
how to fix?? thanks for reply

Answer 20 · 2022-12-06T12:30:59.000Z

you're welcome @nijaaouikhalil, Yes, the program works properly. The screenshot below shows the correct operation of the function (on Linux). Why did you change the pattern? You just need to change the path separator.

hey tanks for great job
i have some ques for this
tensorflow.python.framework.errors_impl.InvalidArgumentError: Value for attr 'Tindices' of double is not in the list of allowed values: int32, int64
; NodeDef: {{node GatherV2}}; Op<name=GatherV2; signature=params:Tparams, indices:Tindices, axis:Taxis -> output:Tparams; attr=batch_dims:int,default=0; attr=Tparams:type; attr=Tindices:type,allowed=[DT_INT32, DT_INT64]; attr=Taxis:type,allowed=[DT_INT32, DT_INT64]> [Op:GatherV2]

my tensor version is tensorflow 2.4.0rc0

Answer 21 · 2022-12-07T08:21:18.000Z

I dont konw why I cant train this model with windows and using 3080, always raise type valueError and I run this exactly in colab raise the same error when i do in the pycharm

I solve the problem,and the reason is I made some mistakes with the [contents]
how to fix?? thanks for reply

I feel sorry Cause I totally forget how I solve the problem,and unfortunately ,those code are saved in another PC which in a location I cant reach recently .I do have a little memorary semms that one of the audio file went wrong from session1,(the name of the file is wrong,).delete it. Hope its a memory from this code,maybe you can check it

Answer 22 · 2022-12-07T08:29:07.000Z

I dont konw why I cant train this model with windows and using 3080, always raise type valueError and I run this exactly in colab raise the same error when i do in the pycharm

I solve the problem,and the reason is I made some mistakes with the [contents]
how to fix?? thanks for reply

I feel sorry Cause I totally forget how I solve the problem,and unfortunately ,those code are saved in another PC which in a location I cant reach recently .I do have a little memorary semms that one of the audio file went wrong from session1,(the name of the file is wrong,).delete it. Hope its a memory from this code,maybe you can check it

Seems like its the IEMOCAP‘s directory have to be same with author's ,"https://github.com/AryaAftab/LIGHT-SERNET/issues/3#issuecomment-1132817523",those are my Memory fragments，try weather if it works

Answer 23 · 2022-12-07T14:49:03.000Z

Thanks a lot @AryaAftab I fixed it, turns out the issue was that I didn't delete the wave files inside the dialog folder and didn't escape them when reading the filenames.

hey，I'm so excited that you can reply me.Iam Chinese but my English is poor.I am try to solve the problem,can I have you wechat . I wanna discuss technology. My Wechat number is CM995958609

Answer 24 · 2022-12-08T02:20:36.000Z

you're welcome @nijaaouikhalil,
Yes, the program works properly. The screenshot below shows the correct operation of the function (on Linux).
Why did you change the pattern? You just need to change the path separator.

i fix it。is a fool question，thanks you reply 。Have a good day