Two Whys Behind Positional Encoding in Transformers

First Why: Why do we need positional encoding in word embeddings instead of passing the embedding vectors directly to the attention mechanism? Second Why: Why do we use sinusoidal functio...

May 19, 2025 Deep_Learning, NLP

Relative Position Bias

Relative position bias, introduced in the Swin Transformer research paper, enhances the Absolute Positional Encoding mechanism by effectively capturing positional information within image patches. ...

Jan 3, 2025 Deep_Learning, CV

Source - https://medium.com/@hari4om/word-embedding-d816f643140

Demystifying the Embedding Layer in NLP

Demystify the embedding layer in NLP, which transforms tokens - whether words, subwords, or characters - into dense vectors.

Sep 14, 2024 Deep_Learning, NLP

Understanding Class Activation Maps (CAM):Part 1

This article series is divided into two parts. In the first part, I will explain the basics of Class Activation Maps (CAM) and how they are calculated. In the second part, I will delve into the w...

Jun 7, 2024 Deep_Learning, CV

Two Whys Behind Positional Encoding in Transformers

Relative Position Bias

Demystifying the Embedding Layer in NLP

Understanding Class Activation Maps (CAM):Part 1

Trending Tags