Video Reconstruction using Transformers
Author(s): Sarvesh Khetan Originally published on Towards AI. Here we can use any type of generative models like Autoencoders / Variational Autoencoders / GANs / …. Diffusion Models / … below I have shown how to reconstruct videos using Autoencoders and Diffusion …
Preference Alignment
Author(s): Sarvesh Khetan Originally published on Towards AI. Preference Alignment Reinforcement Learning with Human Feedback (RLHF) Table of Content Prerequisites Method 1— Preference Alignment using Reinforcement Learning with Human Feedback (RLHF) Algorithm2.a. LLM as Reinforcement Learning Agent 2.b. Reward Modelling using Supervised …
Efficient Training Engine (ETE) for Large Deep Learning Models
Author(s): Sarvesh Khetan Originally published on Towards AI. Table of Contents : There are many ways to efficiently train a large DL model 1. Parallel / Distributed Training Distributed Data Parallelism (DDP)a. DDP Algorithm Intuitionb. DDP Algorithmc. Code Implementation Model Parallelism (MP)a. …
Actor Critic — Deep Reinforcement Learning
Author(s): Sarvesh Khetan Originally published on Towards AI. I have introduced the problem statement here wherein we are trying to build an agent capable of playing Ping Pong Atari game. Before reading this article I would suggest building up foundational knowledge by …
Policy Learning — Deep Reinforcement Learning
Author(s): Sarvesh Khetan Originally published on Towards AI. A. Problem Statement We have already seen the problem statement here and saw how to solve it using Q learning approach but since Q learning has its own drawbacks, researchers innovated this approach called …
Q Learning — Deep Reinforcement Learning
Author(s): Sarvesh Khetan Originally published on Towards AI. Table of Contents Problem Statement Value Functions Q Learning3.a. Theory3.b. Code3.c. Problems with Q Learning V Learning Resources Problem Statement Let’s say our model is trying to play ping pong and is currently in …
LSTM for Sequence Classification
Author(s): Sarvesh Khetan Originally published on Towards AI. Table of Contents : Single Layer Architecture1. LSTM Architecture2. Learning in LSTM3. How LSTM solves issues in RNN4. Issues with LSTM5. Pytorch Code Stacked Layer Architecture1. Architecture Diagram2. Pytorch Code Single Layer Architecture LSTM …
Deep Learning for Time Series Forecasting
Author(s): Sarvesh Khetan Originally published on Towards AI. Table of Contents : 1. Feed Forward Neural Network 2. 1D Convolution Neural Network 3. Hidden Markov Models (HMM) 4. Conditional Random Fields (CRF) 5. Recursive Neural Network (RvNN) 6.1. [1990s] Recurrent Neural Network …
Transformers for Videos
Author(s): Sarvesh Khetan Originally published on Towards AI. Note : this is in continuation of this blog wherein I have discussed different models to perform video classification task. Video is nothing but a sequence of images and hence to make use of …