Tagged with

architectures

Explore machine learning concepts related to architectures. Clear explanations and practical insights.

Concepts Found

Concepts Related to architectures

May 6, 2026

Batch Norm vs Layer Norm: When to Use Which

BatchNorm normalizes over the batch and spatial axes; LayerNorm normalizes over the channel and spatial axes for each sample. The choice changes whether your model trains stably with batch=1, depends on batch composition at inference, and behaves consistently across train and eval.

deep-learning normalization training transformers architectures

8 min readConcept

August 5, 2025

Feature Pyramid Networks

Learn how Feature Pyramid Networks build multi-scale feature representations through top-down pathways and lateral connections for robust object detection.

deep-learning architectures object-detection computer-vision

6 min readConcept

August 5, 2025

Convolution Operation: The Foundation of CNNs

Interactive guide to convolution in CNNs: visualize sliding windows, kernels, stride, padding, and feature detection with step-by-step demos.

deep-learning neural-nets architectures computer-vision

10 min readConcept

August 5, 2025

Dilated Convolutions: Expanding Receptive Fields Efficiently

Understand dilated (atrous) convolutions: how dilation rates expand receptive fields exponentially without extra parameters and how to avoid gridding artifacts.

deep-learning neural-nets architectures optimization

10 min readConcept

August 5, 2025

Receptive Field in CNNs

Understand receptive fields in CNNs: how convolutional layers expand their field of view and the gap between theoretical and effective receptive fields.

deep-learning neural-networks architectures computer-vision

7 min readConcept

August 5, 2025

VAE Latent Space: Understanding Variational Autoencoders

Explore VAE latent space in deep learning. Learn variational autoencoder encoding, decoding, interpolation, and the reparameterization trick.

deep-learning architectures neural-nets training

6 min readConcept

April 8, 2025

CLS Token in Vision Transformers

Learn how the CLS token acts as a global information aggregator in Vision Transformers, enabling whole-image classification through attention mechanisms.

deep-learning attention architectures vision-transformers

8 min readConcept

April 8, 2025

Hierarchical Attention in Vision Transformers

Explore how hierarchical attention enables Vision Transformers (ViT) to process sequential data by encoding relative positions.

deep-learning attention architectures optimization

6 min readConcept

April 8, 2025

Multi-Head Attention in Vision Transformers

Explore how multi-head attention enables Vision Transformers (ViT) to process sequential data by encoding relative positions.

deep-learning attention architectures neural-nets

6 min readConcept

April 8, 2025

Positional Embeddings in Vision Transformers

Explore how positional embeddings enable Vision Transformers (ViT) to process sequential data by encoding relative positions.

deep-learning attention architectures neural-nets

5 min readConcept

April 8, 2025

Interactive Look: Self-Attention in Vision Transformers

Explore how self-attention enables Vision Transformers (ViT) to understand images by capturing global context, with CNN comparison.

deep-learning attention architectures neural-nets

6 min readConcept

January 21, 2025

Adaptive Tiling: Efficient Visual Token Generation

Learn adaptive tiling in vision transformers: dynamically partition images based on visual complexity to reduce token counts while preserving detail.

deep-learning architectures optimization attention

7 min readConcept

April 1, 2024

Skip Connections in Neural Networks

Learn how skip connections and residual learning enable training of very deep neural networks. Understand the ResNet revolution with interactive visualizations.

deep-learning architectures neural-networks training

9 min readConcept