trainNetwork not working with transformedDatastore from audioDatastore

Question

0 votes

Hi,

I'm trying to train a CNN with a database of audio files. For that purpose I'm reading my database with an audioDatastore and transforming it following this example so the net can read it.

The transformedDatastore seems to work well but when training the net it seems to enter on a infinite loop (the training window keeps blank, without any accuracy or loss line).

UPDATE: I was wrong, I didn't had enough patience. After about 10 minutes the program start to plot the Accuracy and Loss values but at a very low frequency (about an iteration per minute for the first 10 iterations, then it begin going faster). Finally the network completed its training in less than 6 hours. When using spectrogramas instead of raw audio files it took less than 1 hour, but the size increase may justify the duration increase.

I guess the problem is now solved but the training keeps being too slow for its purpose, ¿should I open a new question or keep updating this one?

Here is the code I'm using:

ads = audioDatastore(datafolder, ...
    'IncludeSubfolders',true, ...
    'FileExtensions','.wav', ...
    'LabelSource','foldernames');
myds=transform(ads, @myReadFunction,'IncludeInfo',true);
myds=shuffle(myds);
function [dataOut, info]=myReadFunction(dataIn,info)
    dataOut={dataIn,info.Label}
end
net=trainNetwork(myds,layers,opts);

The net layers and options are proved to be fine (I tested with a smaller dataset loaded directly into memory).

Thank you all!

2 commentaires
Afficher Aucune Masquer Aucune

kc le 22 Avr 2020

its giving following error

Undefined function 'transform' for input arguments of type 'audioDatastore'.

how did you even gt the output?

Manuel Lorenzo le 25 Avr 2020

Hi,

Currently I'm not working in this project and I can't remember exactly all te context.

"Transform" is a matlab.datastore function as far as I know (https://es.mathworks.com/help/matlab/ref/matlab.io.datastore.transform.html)

Maybe your problem is related with your Matlab version. I think datastores didn't worked fine until 2019a version.

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Answer 1

jibrahim le 24 Avr 2019

0 votes

Hi Manuel,

What might be happening is that trainNetwork is taking a long time doing an initial normalization of your data. If you have an imageInputLayer, can you try setting its Normalization property to 'none'? That will help us narrow down the issue.

HTH,

Jihad

2 commentaires
Afficher Aucune Masquer Aucune

Manuel Lorenzo le 24 Avr 2019

Hi Jihad!

Thank you for the answer, unfortunately I had already tried disabling the input normalization but the training keeps freezint in the first iteration.

I'm attaching an example of the output I get:

Pd: I've already fixed the code in the description to fit what I'm actually running.

Manuel Lorenzo le 24 Avr 2019

Hi again Jihad,

I've updated the question with new results: after a while the net begin to plot the values with a very low frequency (it's been now 20 minutes for 50 iterations).

I'm not sure if I should close this question and open a new one about this speed problem.

Thank you for your time,

Manuel

Connectez-vous pour commenter.

Answer 2

jibrahim le 26 Avr 2019

0 votes

Hi Manuel,

Is this myReadFunction the actual function you are using? If yes, then it seems that you are sending raw audio data to the network, which can be perfectly valid, but if you are really using a CNN, then your input time-domain audio should be first converted to some time-frequency image-like representation (e.g. spectrogram, mel-spectrogram, etc). The slowness might be due to the large input sample size (whoch would be equal to the length of each audio signal you are sending in).

Would it be possible for you to give me more information on the problem you're trying to solve, and the network structure you are using? Please note that some of the featured examples in Audio Toolbox might be of help (they do not use transform, but they should give an idea of the setup). For example:

Speech command recognition: https://www.mathworks.com/help/audio/examples/Speech-Command-Recognition-Using-Deep-Learning.html

Gender classification: https://www.mathworks.com/help/audio/examples/classify-gender-using-long-short-term-memory-networks.html

HTH,

Jihad

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

Manuel Lorenzo le 29 Avr 2019

Modifié(e) : Manuel Lorenzo le 29 Avr 2019

Hi again Jihad,

Yes, that's the function I'm actually using. I'm trying to feed the network with raw audio data to compare its performance with the spectrogram-feeded one.

I realize the input size to the net is much bigger than using spectrograms (about 8 times bigger), guess it might be enough to explain the time it takes to train the net.

Anyways, I'm using a net with 3 2Dconvolutional layers, each one followed by an reluLayer and a maxpoolingLayer. The goal is to classify 2 different events and the absence of these two so I'm classifying 3 classes using 12 filters in each convolutional layer.

I can't remember right now the size of the filters but I can check the code and provide the sizes to you later this week if necessary.

Thank you for your answer,

Manuel Porta

Connectez-vous pour commenter.

trainNetwork not working with transformedDatastore from audioDatastore

2 commentaires
Afficher Aucune Masquer Aucune

Réponses (2)

2 commentaires
Afficher Aucune Masquer Aucune

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

Catégories

Produits

Tags

Community Treasure Hunt

trainNetwork not working with transformedDatastore from audioDatastore

2 commentaires Afficher Aucune Masquer Aucune

Réponses (2)

2 commentaires Afficher Aucune Masquer Aucune

1 commentaire Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

Catégories

Produits

Tags

Voir également

Community Treasure Hunt

2 commentaires
Afficher Aucune Masquer Aucune

2 commentaires
Afficher Aucune Masquer Aucune

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens