cannot run the IEMOCAP dataset on windows
nijaaouikhalil opened this issue · 24 comments
Hello, could you show the data folder architecture so I understand the way you organised the dataset.
I kept getting errors to segment the data.
I extracted the IEMOCAP_full_release in the data folder the renamed it as IEMOCAP, however, I kept getting errors of files not found.
Hello,
To avoid taking up too much storage space, folders containing video and dialog audio were removed, and the database structure is as follows (plotted by tree command with depth 3). If you encounter more problems, feel comfortable and state them again.
Of course, you must change the path separator ( "/" for Linux) to ( "\" for Windows) anywhere needed
./IEMOCAP/
├── Session1
│ ├── dialog
│ │ └── EmoEvaluation
│ └── sentences
│ └── wav
├── Session2
│ ├── dialog
│ │ └── EmoEvaluation
│ └── sentences
│ └── wav
├── Session3
│ ├── dialog
│ │ └── EmoEvaluation
│ └── sentences
│ └── wav
├── Session4
│ ├── dialog
│ │ └── EmoEvaluation
│ └── sentences
│ └── wav
└── Session5
├── dialog
│ └── EmoEvaluation
└── sentences
└── wav
Hello @AryaAftab,
Thanks a lot for responding.
I've changed the path separator also the pattern in the read_dataset file (line 14) to '{}.\t(.?)\t[' because it wouldn't match as is
I have also changed the label in the same file to match the name of the wavefile. . ( was throwing file not found errors )
lable = label_format.format(wavename[4], wavename[:].replace("/", "\"))
and yet I still getting errors related to the segmentation step.
Can you confrim that the current main version works ?
Thanks a lot
you're welcome @nijaaouikhalil,
Yes, the program works properly. The screenshot below shows the correct operation of the function (on Linux).
Why did you change the pattern? You just need to change the path separator.
Don't mention it @nijaaouikhalil,
I think there is a problem with the iemocap_before_segment function in Windows OS. This problem does not exist in Linux OS. From the error, it can be concluded that there is a problem when extracting the filenames of txt (lines 32 to 39).
can you elaborate what should be the label , the filename and wavename at that stage ( I am not sure why you remove the last 5 characters)
label = label_format.format(wavename[4], wavename[:-5])
also what should the regex match ? my understanding is that it should match the first occurrence of the wave filename (Ses01F_impro01) so we don't care about the (_F000) part .
Assume you have a wav with Ses01F_impro01_F001.wav name:
Line 32 removes .wav from file:
wavename = filename[:-4] ---> Output: Ses01F_impro01_F001
Our label format is :
label_format = 'Session{}/dialog/EmoEvaluation/{}.txt'
In line 33, you separate session numbers and txt filename:
wavename[4] ---> Output: 1
wavename[:-5] ---> Output: Ses01F_impro01
label = label_format.format(wavename[4], wavename[:-5]) ---> Output: 'Session1/dialog/EmoEvaluation/Ses01F_impro01.txt'
In the end, you find Ses01F_impro01_F001 in your text and extract your label(in this example: neu)
Thanks a lot @AryaAftab I fixed it, turns out the issue was that I didn't delete the wave files inside the dialog folder and didn't escape them when reading the filenames.
Think nothing of it @nijaaouikhalil,
If the repository was useful to you, thank you for your support by starring.
Hi @STU-ECHO,
I was trying to debug your problem, but fortunately, you solved the problem yourself.
If the repository was useful to you, thank you for your support by starring.
@AryaAftab
Hello, I can't find the file when I run “python train.py -dn "IEMOCAP" -id 7 -at "all" -ln "focal" -v 1 -it "mfcc" -c "disk" -m false”. I want to ask what may be the reason
Hi @He-jia-wen
The problem created for you in the previous question was created for someone else. Did you read the previous questions and answers?
Hi @He-jia-wen The problem created for you in the previous question was created for someone else. Did you read the previous questions and answers?
Yes, I have read the previous question. It seems that this question is' - 'and' _ ' I modified the code myself, but there is a new question I want to ask
@He-jia-wen
I did not understand what exactly your question is. Your error says "you don't have the Ses01M_script02_0.txt file". Are you sure you didn't delete the label files or change their directory?
I dont konw why I cant train this model with windows and using 3080, always raise type valueError and I run this exactly in colab raise the same error when i do in the pycharm
I solve the problem,and the reason is I made some mistakes with the [contents]
Hello,I meet the same problem with you,How did you solve it exactly in the end?
you're welcome @nijaaouikhalil, Yes, the program works properly. The screenshot below shows the correct operation of the function (on Linux). Why did you change the pattern? You just need to change the path separator.
hey tanks for great job
i have some ques for this
tensorflow.python.framework.errors_impl.InvalidArgumentError: Value for attr 'Tindices' of double is not in the list of allowed values: int32, int64
; NodeDef: {{node GatherV2}}; Op<name=GatherV2; signature=params:Tparams, indices:Tindices, axis:Taxis -> output:Tparams; attr=batch_dims:int,default=0; attr=Tparams:type; attr=Tindices:type,allowed=[DT_INT32, DT_INT64]; attr=Taxis:type,allowed=[DT_INT32, DT_INT64]> [Op:GatherV2]
my tensor version is tensorflow 2.4.0rc0
I dont konw why I cant train this model with windows and using 3080, always raise type valueError and I run this exactly in colab raise the same error when i do in the pycharm
I solve the problem,and the reason is I made some mistakes with the [contents]
how to fix?? thanks for reply
I feel sorry Cause I totally forget how I solve the problem,and unfortunately ,those code are saved in another PC which in a location I cant reach recently .I do have a little memorary semms that one of the audio file went wrong from session1,(the name of the file is wrong,).delete it. Hope its a memory from this code,maybe you can check it
I dont konw why I cant train this model with windows and using 3080, always raise type valueError and I run this exactly in colab raise the same error when i do in the pycharm
I solve the problem,and the reason is I made some mistakes with the [contents]
how to fix?? thanks for replyI feel sorry Cause I totally forget how I solve the problem,and unfortunately ,those code are saved in another PC which in a location I cant reach recently .I do have a little memorary semms that one of the audio file went wrong from session1,(the name of the file is wrong,).delete it. Hope its a memory from this code,maybe you can check it
Seems like its the IEMOCAP‘s directory have to be same with author's ,"https://github.com/AryaAftab/LIGHT-SERNET/issues/3#issuecomment-1132817523",those are my Memory fragments,try weather if it works
Thanks a lot @AryaAftab I fixed it, turns out the issue was that I didn't delete the wave files inside the dialog folder and didn't escape them when reading the filenames.
hey,I'm so excited that you can reply me.Iam Chinese but my English is poor.I am try to solve the problem,can I have you wechat . I wanna discuss technology. My Wechat number is CM995958609
you're welcome @nijaaouikhalil,
Yes, the program works properly. The screenshot below shows the correct operation of the function (on Linux).
Why did you change the pattern? You just need to change the path separator.
i fix it。is a fool question,thanks you reply 。Have a good day