How are the two layers "sequenceInputLayer(num_channels) & bilstmLayer(HU,OutputMode="sequence")" connected to each other?

Question

Christian Holz le 20 Avr 2024

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/2109511-how-are-the-two-layers-sequenceinputlayer-num_channels-bilstmlayer-hu-outputmode-sequence-co

Réponse apportée : Ieuan Evans le 26 Avr 2024

Hello, I would like to know how the connection between the sequenceInputLayer and a lstm or bilstmlayer was implemented.

Usually BiLSTM layers have separate weight matrices for each channel of the input.

In a typical BiLSTM network with a sequenceInputLayer and a bilstmLayer, each unit of the bilstmLayer would be connected to each channel of the sequenceInputLayer. This means that each unit of the bilstmLayer receives input from all channels of the sequenceInputLayer. For example, each unit receives the entire number vector per time step.

Is this correct? Please feel free to forward me the documentation on this topic.

Thank you very much and best regards

Chris

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Answer 1

Debadipto le 23 Avr 2024

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/2109511-how-are-the-two-layers-sequenceinputlayer-num_channels-bilstmlayer-hu-outputmode-sequence-co#answer_1446381

Hi @Christian,

Yes, your understanding is generally correct about how a BiLSTM (Bidirectional Long Short-Term Memory) layer connects to a sequence input layer in a neural network architecture. When you use a sequence input layer followed by a BiLSTM layer, the input sequence is fed into both the forward and backward LSTM layers of the BiLSTM. Each unit in these LSTM layers processes the entire input sequence (or the entire set of features at each time step) but in opposite directions; the forward LSTM processes the sequence from start to end, while the backward LSTM processes it from end to start.

Regarding the documentation, the specifics of how these connections are implemented can vary depending on the software or framework you're using. Here are links to the documentation for popular frameworks that might help:

https://www.mathworks.com/help/deeplearning/ref/nnet.cnn.layer.bilstmlayer.html

https://www.tensorflow.org/api_docs/python/tf/keras/layers/Bidirectional

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Christian Holz le 26 Avr 2024

Modifié(e) : Christian Holz le 26 Avr 2024

Hi @Debadipto,

Thank you very much for your answer.

I agree with your description, but unfortunately I cannot find any reference in the Mathworks documentation. A corresponding description of the implementation would confirm our assumption.

Connectez-vous pour commenter.

Answer 2

Ieuan Evans le 26 Avr 2024

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/2109511-how-are-the-two-layers-sequenceinputlayer-num_channels-bilstmlayer-hu-outputmode-sequence-co#answer_1448466

Hi Christian,

For BiLSTM layers In MATLAB, for each of the input channels and for both the forward and backward parts of the layer, the weight matrices are concatenated into a single matrix. For example, the InputWeights property is a 8*NumHiddenUnits-by-InputSize matrix, where NumHiddenUnits and InputSize are the numbers of hidden units and input channels, respectively.

In this case, the input weight matrix is a concatenation of the eight input weight matrices for the components (gates) in the bidirectional LSTM layer. The eight matrices are concatenated vertically in this order:

Input gate (Forward)
Forget gate (Forward)
Cell candidate (Forward)
Output gate (Forward)
Input gate (Backward)
Forget gate (Backward)
Cell candidate (Backward)
Output gate (Backward)

For more information, see https://uk.mathworks.com/help/deeplearning/ref/nnet.cnn.layer.bilstmlayer.html

For a diagram that shows how data flows through a BiLSTM layer, see https://uk.mathworks.com/help/deeplearning/ug/create-bilstm-function.html

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Connectez-vous pour commenter.

How are the two layers "sequenceInputLayer(num_channels) & bilstmLayer(HU,OutputMode="sequence")" connected to each other?

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponses (2)

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

How are the two layers "sequenceI​nputLayer(​num_channe​ls) & bilstmLaye​r(HU,Outpu​tMode="seq​uence")" connected to each other?

0 commentaires Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Réponses (2)

1 commentaire Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

0 commentaires Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Version

Community Treasure Hunt

How are the two layers "sequenceInputLayer(num_channels) & bilstmLayer(HU,OutputMode="sequence")" connected to each other?

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

0 commentaires
Afficher -2 commentaires plus anciensMasquer -2 commentaires plus anciens