LSTMProjectedLayer - Long short-term memory (LSTM) projected layer for recurrent neural network

  (RNN) - MATLAB ([original](https://in.mathworks.com/help/deeplearning/ref/nnet.cnn.layer.lstmprojectedlayer.html)) ([raw](?raw))

Long short-term memory (LSTM) projected layer for recurrent neural network (RNN)

Since R2022b

Description

An LSTM projected layer is an RNN layer that learns long-term dependencies between time steps in time-series and sequence data using projected learnable weights.

To compress a deep learning network, you can use projected layers. A projected layer is a type of deep learning layer that enables compression by reducing the number of stored learnable parameters. The layer introduces learnable projector matrices_Q_, replaces multiplications of the form Wx, where W is a learnable matrix, with the multiplication WQQ⊤x, and stores Q and W′=WQ instead of storing W. Projecting x into a lower dimensional space using Q typically requires less memory to store the learnable parameters and can have similarly strong prediction accuracy.

Reducing the number of learnable parameters by projecting an LSTM layer rather than reducing the number of hidden units of the LSTM layer maintains the output size of the layer and, in turn, the sizes of the downstream layers, which can result in better prediction accuracy.

Creation

Syntax

Description

Properties

expand all

Projected LSTM

Number of hidden units (also known as the hidden size), specified as a positive integer.

The number of hidden units corresponds to the amount of information that the layer remembers between time steps (the hidden state). The hidden state can contain information from all the previous time steps, regardless of the sequence length. If the number of hidden units is too large, then the layer can overfit to the training data. The hidden state does not limit the number of time steps that the layer processes in an iteration.

The layer outputs data with NumHiddenUnits channels.

To set this property, use the numHiddenUnits argument when you create the LSTMProjectedLayer object. After you create aLSTMProjectedLayer object, this property is read-only.

Output projector size, specified as a positive integer.

The LSTM layer operation uses four matrix multiplications of the form Rht−1, where R denotes the recurrent weights and_ht_ denotes the hidden state (or, equivalently, the layer output) at time step t.

The LSTM projected layer operation instead uses multiplications of the from RQoQo⊤ht−1, where Qo is aNumHiddenUnits-by-OutputProjectorSize matrix known as the output projector. The layer uses the same projector_Qo_ for each of the four multiplications.

To perform the four multiplications of the form Rht−1, an LSTM layer stores four recurrent weights matrices_R_, which necessitates storing 4*NumHiddenUnits^2 learnable parameters. By instead storing the4*NumHiddenUnits-by-OutputProjectorSize matrix R′=RQo and Qo, an LSTM projected layer can perform the multiplication RQoQo⊤ht−1 and store only 5*NumHiddenUnits*OutputProjectorSize learnable parameters.

To set this property, use the outputProjectorSize argument when you create the LSTMProjectedLayer object. After you create aLSTMProjectedLayer object, this property is read-only.

Tip

To ensure that RQoQo⊤ht−1 requires fewer learnable parameters, set theOutputProjectorSize property to a value less than4*NumHiddenUnits/5.

Input projector size, specified as a positive integer.

The LSTM layer operation uses four matrix multiplications of the form Wxt, where W denotes the input weights and_xt_ denotes the layer input at time step_t_.

The LSTM projected layer operation instead uses multiplications of the from WQiQi⊤xt, where Qi is anInputSize-by-InputProjectorSize matrix known as the input projector. The layer uses the same projector_Qi_ for each of the four multiplications.

To perform the four multiplications of the form Wxt, an LSTM layer stores four weight matrices W, which necessitates storing 4*NumHiddenUnits*InputSize learnable parameters. By instead storing the4*NumHiddenUnits-by-InputProjectorSize matrix W′=WQi and Qi, an LSTM projected layer can perform the multiplication WQiQi⊤xt and store only(4*NumHiddenUnits+InputSize)*InputProjectorSize learnable parameters.

To set this property, use the inputProjectorSize argument when you create the LSTMProjectedLayer object. After you create aLSTMProjectedLayer object, this property is read-only.

Tip

To ensure that WQiQi⊤xt requires fewer learnable parameters, set theInputProjectorSize property to a value less than4*NumHiddenUnits*inputSize/(4*NumHiddenUnits+inputSize).

Output mode, specified as one of these values:

"sequence" — Output the complete sequence.
"last" — Output the last time step of the sequence.

The LSTMProjectedLayer object stores this property as a character vector.

To set this property, use the corresponding name-value argument when you create the LSTMProjectedLayer object. After you create a LSTMProjectedLayer object, this property is read-only.

This property is read-only.

Flag for state inputs to the layer, specified as 0 (false) or 1 (true).

If the HasStateInputs property is 0 (false), then the layer has one input with the name"in", which corresponds to the input data. In this case, the layer uses the HiddenState and CellState properties for the layer operation.

If the HasStateInputs property is 1 (true), then the layer has three inputs with the names"in", "hidden", and "cell", which correspond to the input data, hidden state, and cell state, respectively. In this case, the layer uses the values passed to these inputs for the layer operation. If HasStateInputs is 1 (true), then the HiddenState andCellState properties must be empty.

This property is read-only.

Flag for state outputs from the layer, specified as0 (false) or1 (true).

If the HasStateOutputs property is 0 (false), then the layer has one output with the name"out", which corresponds to the output data.

If the HasStateOutputs property is 1 (true), then the layer has three outputs with the names"out", "hidden", and"cell", which correspond to the output data, hidden state, and cell state, respectively. In this case, the layer also outputs the state values that it computes.

This property is read-only.

Input size, specified as a positive integer or "auto". IfInputSize is "auto", then the software automatically assigns the input size at training time.

If InputSize is "auto", then theLSTMProjectedLayer object stores this property as a character vector.

Data Types: double | char | string

Activations

This property is read-only.

Activation function to update the cell and hidden state, specified as one of these values:

"tanh" — Use the hyperbolic tangent function (tanh).
"softsign" — Use the softsign function softsign(x)=x1+|x|.
"relu" (since R2024a) — Use the rectified linear unit (ReLU) function ReLU(x)={x,x>00,x≤0.

The software uses this option as the function σc in the calculations to update the cell and hidden state.

For more information on how an LSTM layer uses activation functions, see Long Short-Term Memory Layer.

The LSTMProjectedLayer object stores this property as a character vector.

Activation function to apply to the gates, specified as one of these values:

"sigmoid" — Use the sigmoid function, σ(x)=(1+e−x)−1.
"hard-sigmoid" — Use the hard sigmoid function,

The software uses this option as the function σg in the calculations for the layer gates.

The LSTMProjectedLayer object stores this property as a character vector.

To set this property, use the corresponding name-value argument when you create the LSTMProjectedLayer object. After you create a LSTMProjectedLayer object, this property is read-only.

State

Cell state to use in the layer operation, specified as a NumHiddenUnits-by-1 numeric vector. This value corresponds to the initial cell state when data is passed to the layer.

After you set this property manually, calls to the resetState function set the cell state to this value.

If HasStateInputs is 1 (true), then the CellState property must be empty.

Data Types: single | double

Hidden state to use in the layer operation, specified as aNumHiddenUnits-by-1 numeric vector. This value corresponds to the initial hidden state when data is passed to the layer.

After you set this property manually, calls to the resetState function set the hidden state to this value.

If HasStateInputs is 1 (true), then the HiddenState property must be empty.

Data Types: single | double