Training with trainNetwork failed. The value of 'ValidationData' is invalid. The datastore used for 'ValidationData' must return a table or cell array 2 columns.with at least

6 vues (au cours des 30 derniers jours)
Hello, I want to do classification with LSTM using Deep Network Designer. But my data consists of features in .xlsx format. I created a datastore with the code given below and did the necessary operations. After importing the data, when I click on the Training tab in the deep network designer section, "Training with trainNetwork failed. The value of 'ValidationData' is invalid. The datastore used for 'ValidationData' must return a table or cell array 2 columns with at least" error message. Can you help.
  3 commentaires
Matt J
Matt J le 17 Avr 2024
It is also preferred to copy/past your code rather than share a screenshot.
Yes, otherwise we cannot copy/paste it to demonstrate solutions.
Atakan Öztürk
Atakan Öztürk le 18 Avr 2024
% Load data
[numData, txtData, raw] = xlsread('veriseti2.xlsx');
% Separating numeric and text data
numericData = numData(:, 1:5); % The first 5 columns represent numeric data
textData = raw(:, 6); % Column 6 represents text data
% Convert text data to categorical variables
categoricalData = categorical(textData);
% Convert categorical variables to numeric values
encodedData = grp2idx(categoricalData);
% Separating input data and output labels
inputData = numericData; % Numeric properties
outputLabels = encodedData; % Converted categorical labels
% Create a partition that splits the data for training and validation
cv = cvpartition(size(numericData, 1), 'HoldOut', 0.2); % 80% training, 20% validation
% Get indices of training and validation data
trainingIdx = cv.training;
% Separate training and validation data
TrainingData = inputData(trainingIdx, :);
ValidationData = inputData(~trainingIdx, :);
TrainingLabels = outputLabels(trainingIdx, :);
ValidationLabels = outputLabels(~trainingIdx, :);
% Convert training and validation data to datastore
dstrain = arrayDatastore(TrainingData); % Training data and tags
dsvalidation = arrayDatastore(ValidationData); % Validation data and tags

Connectez-vous pour commenter.

Réponse acceptée

Cris LaPierre
Cris LaPierre le 18 Avr 2024
Because the input must be a datastore, you need to format your input so that your five features are a column vector. Here's code that I wrote that got the training to work in the Deep Network Designer
% Load data
data = readtable('veriseti2.xlsx');
% Format data
data = convertvars(data,"Var6",'categorical');
trainingData = mergevars(data,1:5);
trainingData.Properties.VariableNames = ["Predictors","Response"];
trainingData.Predictors = rowfun(@transpose,trainingData,"InputVariables","Predictors","OutputFormat","cell");
% Create a partition that splits the data for training and validation
cv = cvpartition(height(trainingData), 'HoldOut', 0.2); % 80% training, 20% validation
% Get indices of training and validation data
trainingIdx = cv.training;
validationIdx = cv.test;
% Separate training and validation data
TrainingData = trainingData(trainingIdx, :);
ValidationData = trainingData(validationIdx, :);
% Convert training and validation data to datastore
dstrain = arrayDatastore(TrainingData,'OutputType','same'); % Training data and tags
dsvalidation = arrayDatastore(ValidationData,'OutputType','same'); % Validation data and tags
I find that, at least with your data set, having to work with datastores makes it more difficult. I would reocmmend exporting your network using the Generate Network Code without Parameters and set up your training programmatically. Here's what that might look like.
% Create a partition that splits the data for training and validation
cv = cvpartition(height(data), 'HoldOut', 0.2); % 80% training, 20% validation
% Get indices of training and validation data
trainingIdx = cv.training;
validationIdx = cv.test;
% Separate training and validation data
TrainingData = data(trainingIdx, :);
ValidationData = data(validationIdx, :);
layers = [
featureInputLayer(5,'Normalization', 'zscore')
lstmLayer(128,"Name","lstm")
dropoutLayer(0.5,"Name","dropout")
fullyConnectedLayer(4,"Name","fc")
softmaxLayer("Name","softmax")
classificationLayer("Name","classoutput")];
options = trainingOptions('adam', ...
'MiniBatchSize',128, ...
'Shuffle','every-epoch', ...
'ValidationData',ValidationData, ...
'Plots','training-progress', ...
'Verbose',false);
net = trainNetwork(TrainingData,layers,options)
  1 commentaire
Cris LaPierre
Cris LaPierre le 18 Avr 2024
If you do want to train in the Deep Network Designer, if you add a flatten layer, then your formatting of the input could just be
data = readtable('veriseti2.xlsx');
data = convertvars(data,"Var6",'categorical');
data = mergevars(data,1:5);

Connectez-vous pour commenter.

Plus de réponses (1)

Catalytic
Catalytic le 17 Avr 2024
Modifié(e) : Catalytic le 17 Avr 2024
Your ValidationData is just an arrayDatastore of input data. The error message has told you that it must include input and output pairs, and that these must be in cell array or table form.

Catégories

En savoir plus sur Image Data Workflows dans Help Center et File Exchange

Produits


Version

R2023b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by