YAMNET Training for Bird Dataset

Question

YAMNET Training for Bird Dataset

Maanikya opened this issue 2 years ago · 5 comments

Maanikya commented 2 years ago

Dear Sir,

First of all, thank you very much for your email and replying at the earliest.

My problems are as follows:

I'm a beginner to ML and DL field.
I'm facing problem on replacing the dataset you used into the dataset that I want to use without using CSV file as Metadata.
I'm finding it hard to wrap my head around the functions that you used to creating "train_ds" variable to feed to the model for training

My goal:
To develop a YAMNET model for bird audio classification and to implement it as an Android App.

I'm doing this as a team for our major project.

Link for the Dataset: https://www.kaggle.com/datasets/maanikya/yamnet-dataset-v2

Sir, can you please help us in tweaking the code to fit our requirements?

Answer 1 · 2023-04-18T17:50:32.000Z

Not able to look into all details now, but following are some recommendations.

tensorflow io seems to not work on resampling.. Please use scipy... Notebook example below
https://www.tensorflow.org/hub/tutorials/yamnet
Original notebook ( where tfio is used)
https://www.tensorflow.org/tutorials/audio/transfer_learning_audio
Follow the dataset conventions in the notebook. Use the same conventions in your csv file ( which has path to audio files)
For training the data, I believe first the embeddings are being extracted. And these are used as features to finetune the classification.
I can revisit the notebook later, and update on 4.
Hope it was useful..
Also, please paste and specific errors you get while running the notebooks..

Answer 2 · 2023-04-18T18:36:37.000Z

Sure sir, I will try to incorporate your suggestions. I was able to train a simple YAMNET model using the dataset that I wanted to. I will try and improve it. Thank you.

…

On Tue, Apr 18, 2023, 11:20 PM vsnupoudel ***@***.***> wrote: Not able to look into all details now, but following are some recommendations. 1. tensorflow io seems to not work on resampling.. Please use scipy... Notebook example below https://www.tensorflow.org/hub/tutorials/yamnet 2. Original notebook ( where tfio is used) https://www.tensorflow.org/tutorials/audio/transfer_learning_audio 3. Follow the dataset conventions in the notebook. Use the same conventions in your csv file ( which has path to audio files) 4. For training the data, I believe first the embeddings are being extracted. And these are used as features to finetune the classification. I can revisit the notebook later, and update on 4. Hope it was useful.. Also, please paste and specific errors you get while running the notebooks.. — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ARQ7Y2TP5ZFXI2VRVDLEAATXB3H7HANCNFSM6AAAAAAXBIJRQA> . You are receiving this because you authored the thread.Message ID: <vsnupoudel/Audio-Classification--transfer-learning-and-fine-tuning--Android-application/issues/1/1513575001 @github.com>

Answer 3 · 2023-04-18T18:37:50.000Z

I will also share the notebook with you tomorrow.

…

On Tue, Apr 18, 2023, 11:20 PM vsnupoudel ***@***.***> wrote: Not able to look into all details now, but following are some recommendations. 1. tensorflow io seems to not work on resampling.. Please use scipy... Notebook example below https://www.tensorflow.org/hub/tutorials/yamnet 2. Original notebook ( where tfio is used) https://www.tensorflow.org/tutorials/audio/transfer_learning_audio 3. Follow the dataset conventions in the notebook. Use the same conventions in your csv file ( which has path to audio files) 4. For training the data, I believe first the embeddings are being extracted. And these are used as features to finetune the classification. I can revisit the notebook later, and update on 4. Hope it was useful.. Also, please paste and specific errors you get while running the notebooks.. — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ARQ7Y2TP5ZFXI2VRVDLEAATXB3H7HANCNFSM6AAAAAAXBIJRQA> . You are receiving this because you authored the thread.Message ID: <vsnupoudel/Audio-Classification--transfer-learning-and-fine-tuning--Android-application/issues/1/1513575001 @github.com>

Answer 4 · 2023-04-18T20:04:20.000Z

Also, tfio apparently works with a specific tf version.

Answer 5 · 2023-04-19T04:04:25.000Z

Dear Sir, I have attached the Notebook in this email. I have used a rather simple approach and I'm satisfied with the results that it provides for the time being. Part of the reason being time constraint due to deadlines and other reasons. However I will try to implement your suggestions in the work such as waveform resampling if it is not in the desired sample rate, etc. Thank you for your quick replies and helping me out, especially by sacrificing your sleep. So I'm concerned about your health and your dedication to my help as you have been helping me without even asking anything. It is a rare sight to see someone help a stranger and be this kind. So sir, for the time being, as I have got some satisfactory results, I think I can manage on my own from here on. Hence please focus on your work and if I need any assistance, I'll let you know by dropping an email. Once again, Thank you so much for your help and support. 👏

…

On Wed, 19 Apr 2023 at 01:34, vsnupoudel ***@***.***> wrote: Also, tfio apparently works with a specific tf version. [image: IMG_20230419_013349] <https://user-images.githubusercontent.com/19270934/232892507-4d79eb0c-47db-47db-b1d2-0670f4865e88.jpg> — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ARQ7Y2WRAVJ7FZBK7EQMUETXB3XVBANCNFSM6AAAAAAXBIJRQA> . You are receiving this because you authored the thread.Message ID: <vsnupoudel/Audio-Classification--transfer-learning-and-fine-tuning--Android-application/issues/1/1513731184 @github.com>