Layer Normalization
Learn layer normalization for transformers and sequence models: how normalizing across features enables batch-independent training with interactive visualizations.
7 min readConcept
Explore machine learning concepts related to normalization. Clear explanations and practical insights.
Learn layer normalization for transformers and sequence models: how normalizing across features enables batch-independent training with interactive visualizations.
Understand internal covariate shift in deep learning: why layer input distributions change during training, how it slows convergence, and how batch normalization fixes it.
Learn batch normalization in deep learning: how normalizing layer inputs accelerates training, improves gradient flow, and acts as regularization with interactive visualizations.