How is it possible to use a validation set with a LSTM?

Question

1 vote

When I try to use the Validation set with a LSTM layer, it shows the following error:

options = trainingOptions('adam', ...
    'ExecutionEnvironment','gpu', ...
    'GradientThreshold',1, ...
    'MaxEpochs',maxEpochs, ...
    'ValidationData',{XTest,YTest},...
    'MiniBatchSize',miniBatchSize, ...
    'LearnRateSchedule','piecewise', ...
    'SequenceLength','longest', ...
    'Shuffle','never', ...
    'Verbose',0, ...
    'Plots','training-progress');
net = trainNetwork(XTrain,categorical(YTrain),layers,options);

Error:

Training with validation data is not supported for networks with LSTM layers.

Is there another way to use the Validation set during the training of the network?

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

Kenta le 14 Mar 2020

Now that, the use of validation data with LSTM network is available.

The example follows:

https://jp.mathworks.com/matlabcentral/fileexchange/74402-video-classification-using-lstm-lstm

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Answer 1

Joss Knight le 29 Avr 2018

1 vote

It's ugly, but if you use Checkpoints, then you can use an OutputFcn to (once per epoch) load the network from a checkpoint and run it against your validation data. It isn't very efficient, but it's okay if you're only doing it once per epoch. You won't get it on the training plot of course.

10 commentaires
Afficher 8 commentaires plus anciens Masquer 8 commentaires plus anciens

404: name not found le 17 Juin 2020

Sadly, I only have access to 2018a with the Neural Network Toolbox at my university.

So I am still facing that problem. I already managed to setup the OutputFCN, but somehow the data gets lost after eacht epoch and is not saved. Also I can't fetch the LSTM - net after one epoch, with a checkpoint.

After setting optimoptions I can access the OutputFcn.

opt<-optimoptions (vargin)

options=( [....],...

'OutputFcn',@(info,output)outputFCN(info,opt,net));

Since info and net are output variables of 'trainNetwork', I had hoped that this variable would contain the network data (and that I won't need checkpoints). Essentially it stays empty. (if anyone was wondering...)

If I define output outside of the outputFCN as an empty struct (or an empty array), doTrainNetworks throws an error and it says:

Error using nnet.internal.cnn.util.UserCallbackReporter>iCallbackWrapper (line 115) Not enough input arguments.

Error in nnet.internal.cnn.util.UserCallbackReporter>@(f)iCallbackWrapper(f,this.Info) (line

85)

stop = cellfun( @(f) iCallbackWrapper(f, this.Info), this.Callbacks );

Error in nnet.internal.cnn.util.UserCallbackReporter/callCallbacks (line 85)

stop = cellfun( @(f) iCallbackWrapper(f, this.Info), this.Callbacks );

Error in nnet.internal.cnn.util.UserCallbackReporter/start (line 48)

this.callCallbacks();

Error in nnet.internal.cnn.util.VectorReporter/computeAndReport (line 56)

feval( method, this.Reporters{i}, varargin{:} );

Error in nnet.internal.cnn.util.VectorReporter/start (line 16)

computeAndReport( this, 'start' );

Error in nnet.internal.cnn.Trainer/train (line 62)

reporter.start();

Error in trainNetwork>doTrainNetwork (line 250)

trainedNet = trainer.train(trainedNet, trainingDispatcher);

only if 'output' is empty this error is not generated. Even if the data in info is now not interesting to me, since it is the same information in info, after a regular 'trainNetwork run', it is a problem as soon as I can access the weights of the NN and calculate the Validation RMSE, since this data will be lost after each step too.

I'd really apprechiate it if you could help me with both problems.

Firstly --> getting the Neural Network data after one epoch.

Secondly ---> how to get the data out of the OutputFcn without throwing an error as soon as I try to save it.

THANK you very much!

Joss Knight le 6 Oct 2020

Hey M J, you should probably ask a new question and provide a bit more detail and code. Thanks.

M J le 8 Oct 2020

Modifié(e) : M J le 8 Oct 2020

Hi, thank you for your answer. I did ask a new question (see link below) :

I do not have a code for this, as I am really not sure where to even start. Also, I am not sure if it is okay to post a link to the question here, but if not, please let me know. Thank you.

https://www.mathworks.com/matlabcentral/answers/605561-is-there-a-way-to-get-a-predicted-label-for-every-group-of-patches-at-every-iteration-while-trainin?s_tid=prof_contriblnk

Connectez-vous pour commenter.

Answer 2

Mads Bergholt le 17 Mai 2018

0 votes

Dear Joss, will this be part of Matlab 2018b? This is an aspect of LSTM that is very important for validating these algorithms.

Best regards Mads

3 commentaires
Afficher 1 commentaire plus ancien Masquer 1 commentaire plus ancien

XIANFENG XU le 7 Nov 2018

I am sorry to say that this is still not included in Matlab 2018b. Sigh. Maybe we have to turn to Tensorflow for deep learning.

Joss Knight le 13 Nov 2018

ValidationData is indeed supported for Sequence Networks in R2018b: https://www.mathworks.com/help/deeplearning/ref/trainingoptions.html#bu59f0q_sep_mw_4d4d5e80-6684-47de-986c-f9f8258b7c6d

There are some restrictions on the format of the data.

Connectez-vous pour commenter.

Answer 3

Mads Bergholt le 20 Mai 2018

0 votes

thanks Joss

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Connectez-vous pour commenter.

How is it possible to use a validation set with a LSTM?

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

Réponse acceptée

10 commentaires
Afficher 8 commentaires plus anciens Masquer 8 commentaires plus anciens

Plus de réponses (2)

3 commentaires
Afficher 1 commentaire plus ancien Masquer 1 commentaire plus ancien

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Catégories

Produits

Tags

Community Treasure Hunt

How is it possible to use a validation set with a LSTM?

1 commentaire Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

Réponse acceptée

10 commentaires Afficher 8 commentaires plus anciens Masquer 8 commentaires plus anciens

Plus de réponses (2)

3 commentaires Afficher 1 commentaire plus ancien Masquer 1 commentaire plus ancien

0 commentaires Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Catégories

Produits

Tags

Voir également

Community Treasure Hunt

1 commentaire
Afficher -1 commentaires plus anciens Masquer -1 commentaires plus anciens

10 commentaires
Afficher 8 commentaires plus anciens Masquer 8 commentaires plus anciens

3 commentaires
Afficher 1 commentaire plus ancien Masquer 1 commentaire plus ancien

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens