ML Concepts - Page 3

Clear explanations of core machine learning concepts, from foundational ideas to advanced techniques. Understand attention mechanisms, transformers, skip connections, and more.

189

Total Concepts

357

Topics

Page 3

of 32

Concepts - Page 3

August 16, 2024

High Bandwidth Memory (HBM)

How HBM works: 3D-stacked DRAM, TSVs, and silicon interposers explained with interactive visualizations — from the memory wall to HBM4 and the roofline model.

hbm memory gpu bandwidth 3d-stacking tsv ai-hardware

No direct links0 refs

August 17, 2025

GPU Memory Hierarchy & Optimization

Master GPU memory hierarchy from registers to global memory, understand coalescing patterns, bank conflicts, and optimization strategies for maximum performance

GPU CUDA memory-optimization performance parallel-computing HBM cache

No direct links0 refs

January 29, 2025

Multi-GPU Communication: NVLink, PCIe, and NCCL

How GPUs talk: the bandwidth cliff from HBM to Ethernet, NVLink 5 and GB200 NVL72 topologies, ring AllReduce step by step, and choosing between NCCL, Gloo, and MPI.

multi-GPU NVLink PCIe NCCL distributed training GPU interconnect NVSwitch AllReduce InfiniBand

No direct links0 refs

March 15, 2026

Slurm GPU Allocation for Distributed Training

Complete guide to GPU allocation on Slurm — --gres flags, CUDA_VISIBLE_DEVICES remapping, GPU topology and NVLink binding, MIG partitioning, production job scripts, and debugging common GPU errors.

hpc slurm gpu-computing distributed-training

No direct links0 refs

January 6, 2025

Python Memory Management

A practical mental model for CPython memory management: names and references, object headers, PyMalloc arenas, reference counting, reuse paths, and memory profiling.

programming python memory-mgmt internals

No direct links0 refs

January 6, 2025

Filesystems: The Digital DNA of Data Storage

Explore Linux filesystems through interactive visuals. Learn VFS, compare ext4 vs Btrfs vs ZFS, and understand file operations.

linux filesystems storage

No direct links0 refs

Showing 13-18 of 189 articles