Post-Layer Normalization and Sublayer 2: Feedforward Network

Learn about how post-layer normalization is performed along with the components of the feedforward network.

Layer normalization will now process the attention sublayer.

Post-layer normalization

Each attention sublayer and each feedforward sublayer of the transformer is followed by post-layer normalization (Post-LN):

