
LSTM for Sequence Classification
Last Updated on January 18, 2025 by Editorial Team
Author(s): Sarvesh Khetan
Originally published on Towards AI.
Table of Contents :
- Single Layer Architecture
1. LSTM Architecture
2. Learning in LSTM
3. How LSTM solves issues in RNN
4. Issues with LSTM
5. Pytorch Code - Stacked Layer Architecture
1. Architecture Diagram
2. Pytorch Code

Single Layer Architecture

LSTM Architecture
This is similar to RNN architecture that we saw here just that now we will replace the RNN unit with an LSTM unit

Now letβs discuss what goes on inside the LSTM unit, following video clearly explains the same!!

Learning in LSTM
Since we are using LSTM to solve a classification task we can use cross entropy loss to train the network, as shown below

Now this optimization can be solved using any optimizer i.e. gradient descent / Adam / AdaGrad / β¦ (stochastic or mini batch version). Below lets try solving it using gradient descent

Now to calculate these derivates we will take help of computation graph for this RNN architecture which is shown below



Similarly calculate derivatives of other matrices !!

How LSTM solves issues with RNN
Here we discusses issues with RNN namely vanishing gradient and exploding gradient. Hence to see if LSTM faces with similar issue or not letβs consider the above derivative dE / dWf



Issues with LSTM
LSTMs solved the vanishing gradient problem but LSTMs are computationally very very heavy thus taking huge training time and hence we wanted something which can train much faster and also give at least as good results as LSTMs because LSTMs really gave very good results.

Pytorch Code
# Create a single LSTM cell
lstm_cell = nn.LSTMCell(input_size=10, hidden_size=10)

Stacked Architecture

Architecture Diagram


Pytorch Code
lstm_stack = nn.LSTM(input_size=10, hidden_size=10, num_layers=3)
# 3 single LSTM cells stacked on top of each other
Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.
Published via Towards AI