[YOLOv2] Train the feature layer instead of transfer learning

5 vues (au cours des 30 derniers jours)
Tai
Tai le 28 Nov 2023
Commenté : Venu le 5 Déc 2023
As explained in the example in https://de.mathworks.com/help/vision/ug/train-an-object-detector-using-you-only-look-once.html, most YOLO network in Matlab context is trained via transfer learning. The feature layer is pretrained, and using trainYOLOv2ObjectDetector I can only adapt the coefficients in detector layers. When I change the
[detector, info] = trainYOLOv2ObjectDetector(augmentedTrainingData,lgraph, opts);
with detector = trainNetwork(dsTrain,lgraph,opts);
I have the following error (info. I have increased the labels to 4 classes)
Error using trainNetwork
Invalid training data. The output size ([23 40 63]) of the last layer does not match the response size ([1 1 4]).
How can I realize training of the complete YOLOv2 network, not just part of it?
Thank you.
  3 commentaires
Tai
Tai le 4 Déc 2023
Hi @Venu:
Thank you very much for your reply. It seems that "trainYOLOv2ObjectDetector" will implicitly incorporate bounding box and anchor information into the training processing. If I want to use "tainNetwork" for training, I have to manually add these information into the "dsTrain", correct?
Actually, my intention is to train the complete YOLOv2 network instead of the layers only reponsible for detector. Is it possible to use "trainYOLOv2ObjectDetector" to realize my intention?
Besides, here are the answers to your question, in case it is impossible to do it by using "trainYOLOv2ObjectDetector":
Can you provide your network's output layer structure, some info about how your training data is organized, including how the bounding boxes and class labels are formatted?
  • I use Darknet19 as backbone network, and the feature network is till the layer "leaky_relu_13"
  • I have put the training data as mat-file in the attachment.
If you're using an "augmentedImageDataStore", can you provide details on how it was created and what preprocessing steps were applied?
  • I have resize the datainto [720 720 3]
  • Apply random horizontal flipping, and random X/Y scaling. Boxes that get scaled outside the bounds are clipped if the overlap is above 0.25. Also, jitter image color.
Venu
Venu le 5 Déc 2023
Hi @Tai,
Firstly I'll answer your queries.
"I have to manually add these information into the "dsTrain", correct? "YES.
"Actually, my intention is to train the complete YOLOv2 network instead of the layers only reponsible for detector. Is it possible to use "trainYOLOv2ObjectDetector" to realize my intention? "Unfortunately NO.The "trainYOLOv2ObjectDetector" function is designed for transfer learning and is not intended for training the entire network from scratch. It assumes that the backbone network (in your case, Darknet19 up to "leaky_relu_13") is pre-trained and its weights are frozen during training. This means that only the layers added beyond the backbone (i.e., the detection layers) are trained to adapt to your specific object detection task. If you want to train the complete YOLOv2 network, you need to either train from scratch or fine-tune the entire network, including the feature extraction layers (Darknet19). So you cannot use "trainYOLOv2ObjectDetector" and must proceed with "trainNetwork".
Coming to the data and info that you have provided I would suggest these steps:
1. Apply the same augmentations to both the images and the bounding boxes and add to "trainNetwork" manually. Some entries are empty [ ], which suggests that no object was detected or annotated in that particular image. You'll need to decide how to handle these cases during training - whether to include them as negative examples or to exclude them from the training process.
2. Preprocess the bounding box data into the format expected by YOLO. This involves assigning each bounding box to the appropriate cell in the 45x45 grid based on its coordinates, encoding the bounding box coordinates relative to the cell it's assigned to, encoding the objectness score and class probabilities.
Hope this helps!

Connectez-vous pour commenter.

Réponses (0)

Produits


Version

R2023b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by