There are some papers based on frame prediction which are using CNN , autoencoder, etc based approach. You can pick anyone of them and implement on MATLAB. Those approaches require several deep learning layers like, convolution layer activation function, pooling layer,etc. You can refer to this documentation which explain about the several deep learning layers and its application. It will help you in implementation.