Computer Vision

Object detection, feature pyramids, and visual recognition techniques.

8 concepts

All Computer Vision Concepts

August 5, 2025

Feature Pyramid Networks (FPN) Explained

Learn how Feature Pyramid Networks build multi-scale feature representations through top-down pathways and lateral connections for robust object detection.

deep-learning architectures object-detection computer-vision

No direct links0 refs

December 26, 2024

ASFF: Adaptive Spatial Feature Fusion

Learning where to fuse multi-scale features with per-pixel, per-level fusion weights. ASFF challenges FPN's uniform fusion assumption.

Object Detection Feature Fusion FPN Multi-Scale Spatial Attention YOLO

No direct links0 refs

December 26, 2024

RoI Pooling, RoI Align & Deformable RoI Pooling

Understanding region-based feature extraction for object detection, from quantized pooling to sub-pixel alignment and adaptive sampling

Object Detection RoI Pooling RoI Align Faster R-CNN Mask R-CNN Computer Vision

No direct links0 refs

December 24, 2024

Anchor-Based vs Anchor-Free Object Detection

Compare anchor-based vs anchor-free object detection: Faster R-CNN and RetinaNet anchors vs FCOS and CenterNet point-based methods.

Object Detection Anchors FCOS CenterNet Computer Vision Deep Learning

No direct links0 refs

December 24, 2024

NAS-FPN: Learning to Design Feature Pyramid Networks

Understanding how neural architecture search discovers optimal feature pyramid architectures that outperform hand-designed alternatives

Object Detection NAS Feature Pyramids Neural Architecture Search Computer Vision

No direct links0 refs

December 23, 2024

DETR Explained: Object Detection with Transformers

Understanding end-to-end object detection with transformers, from DETR's object queries to bipartite matching and attention-based localization

Object Detection DETR Transformers Computer Vision Deep Learning Attention Mechanisms

No direct links0 refs

December 23, 2024

NMS & Soft-NMS: Removing Duplicate Detections

Understanding Non-Maximum Suppression algorithms for object detection post-processing, from greedy NMS to soft variants

Object Detection NMS Soft-NMS Computer Vision Post-Processing

No direct links0 refs

December 20, 2024

Visual Complexity Analysis for Token Allocation

Learn how visual complexity analysis optimizes vision transformer token allocation using edge detection, FFT, and entropy metrics.

Computer Vision Vision Transformers Token Allocation Edge Detection FFT Information Theory

No direct links0 refs