How to detect multiple objects in a video file and label these objects with bounding boxes.

Hi, I want to know how I can create my own object detection using existing convolution neural networks like alexnet and have this applied to a video file that shows the bounding boxes of those objects.
Where are good places and sources so I can understand how to do this?

 Réponse acceptée

Watch Joe Hicklin's demo video, that does exactly that: MathWorks Deep Learning in 11 lines of code

3 commentaires

This is object classification.
Is this not different to multiple object detection within a a video?
If not can you give me an example?
Well if all you need to do is to detect it, rather than figure out what it is, then it might be simpler. It might not even need a CNN. You might be able to just get by with simple thresholding. Who knows, because you dind't upload an image.
Like I said, any object would be fine, I simply want to understand how to get to the point that I can pass a video file and be able to detect multiple objects and their bounding boxes using pre-trained CNN's like alexnet.
Simple classification isnt enough, something like the documentation seen here:
However, without the requirement of the ROI of the image, that can create its own bounding boxes.

Connectez-vous pour commenter.

Plus de réponses (0)

Catégories

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by