SVM Model not working properly on test data

9 vues (au cours des 30 derniers jours)
Adam Hasse
Adam Hasse le 8 Jan 2018
Commenté : G.N. Cobbold le 14 Déc 2020
Hello all, I'm using fitcsvm to classify two sets of data, Class 1 and Class 2. When I train the model on a dataset and then check how it did using predict, it seems to be working perfectly. However, when I input a new dataset, it only ever classifies it in Class 2, even though I use the cost modifier to state that classifying Class 1 in Class 2 is three times worse than misclassifying Class 2 as a Class 1.
SVM Model:
SVMModel = fitcsvm(data, classLabel, 'KernelFunction', 'gaussian',...
'Standardize',true, 'ClassNames', {'Class 1','Class 2'},...
'Cost',[0 3; 1 0]);
Verification of Training set:
[svmLabel, score] = predict(SVMModel, data);
Test Set:
[newsvmLabel, score] = predict(SVMModel, testData);
I think I'm using it correctly, so any advice on what's going on would be much appreciated.
  1 commentaire
the cyclist
the cyclist le 8 Jan 2018
Can you upload the data, or a small sample that exhibits the problem? That way we can run your code and see for ourselves.

Connectez-vous pour commenter.

Réponses (3)

Bernhard Suhm
Bernhard Suhm le 23 Fév 2018
You are applying the predict function correctly, so there must be something wrong with your testData, probably the type in one column doesn't match the training data. Without providing an example - as "the cyclist" request - impossible to help you further.

Roushanak Haji Hassani
Roushanak Haji Hassani le 1 Oct 2018
did you solve the issue? I have a similar problem.
  1 commentaire
Bernhard Suhm
Bernhard Suhm le 1 Oct 2018
Without the data we are just guessing at what the problem may be. Actually you may need to look at your training data as well, maybe that's highly imbalanced and the cost function didn't work.

Connectez-vous pour commenter.


Roushanak Haji Hassani
Roushanak Haji Hassani le 1 Oct 2018
Modifié(e) : Roushanak Haji Hassani le 1 Oct 2018
I have attached the train data set and test here. The first column in the train dataset is the labels and the others are extracted features. I have applied Linear SVM classifier and I got 92.5 % accuracy on train data. But when I am applying on the test data the result is not good at all. I would appreciate if you let me know what is wrong with data.
  4 commentaires
SALWA BEN YOUNES
SALWA BEN YOUNES le 7 Oct 2020
Did you solve the issue? I have a similar problem
G.N. Cobbold
G.N. Cobbold le 14 Déc 2020
Hi Salwa
Did you solve the issue? I have a similar problem

Connectez-vous pour commenter.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by