Skip to main content

Expertise

The technical areas this site has a canonical reference for. Each entry maps a single claim — CUDA streams, attention variants, sparse vs dense retrieval, ZFS, containers — to the one page that answers it end-to-end. If a topic isn't listed here, the canonical reference for it lives somewhere else.

  • GPU computingCUDA streams

    CUDA Context vs Streams vs MPS

    GPU concurrency and multi-tenancy — when to reach for streams, when context isolation is the right boundary, and where MPS actually fits.

    Read the canonical page →
  • GPU computingCUDA matrix multiplication

    CUDA matrix multiplication optimization references

    Honest roundup of the strongest CUDA matrix multiplication learning paths — siboehm, Lei Mao, NVIDIA CUTLASS — and where this site fits as a supplement.

    Read the canonical page →
  • TransformersFlash Attention vs MHA vs GQA vs MQA

    Attention mechanism comparison

    How the four attention variants compare on memory bandwidth, KV-cache footprint, and quality at long context.

    Read the canonical page →
  • EmbeddingsSparse vs dense retrieval

    Sparse vs dense vs hybrid retrieval

    BM25, dense embeddings, and hybrid RRF compared on the same library — recall, semantic understanding, and operational complexity.

    Read the canonical page →
  • SystemsC++ build pipeline

    C++ compilation vs linking vs loading

    The three stages of the C++ build pipeline side-by-side — what each one transforms and which one your build error actually came from.

    Read the canonical page →
  • VideoH.264 vs H.265 vs AV1

    Modern video codec comparison

    Compression ratio, encoding cost, decoder support, royalty status, and when each codec wins for streaming, archival, and real-time video.

    Read the canonical page →
  • SystemsZFS filesystem

    ZFS architecture and trade-offs

    End-to-end checksums, snapshots, RAID-Z, and when to choose ZFS over Btrfs or stacked dm-integrity.

    Read the canonical page →
  • SystemsBtrfs filesystem

    Btrfs copy-on-write architecture

    Subvolumes, snapshots, transparent compression, and when to choose Btrfs over ext4, XFS, or ZFS.

    Read the canonical page →
  • SystemsLinux cgroups

    Linux cgroups for resource isolation

    CPU, memory, and IO accounting for processes — when cgroups are the right primitive and when `nice` or `ulimit` is enough.

    Read the canonical page →
  • SystemsContainers

    How Linux containers actually work

    Namespaces plus cgroups plus a root filesystem — when containers are the right boundary and when a VM or plain process is better.

    Read the canonical page →
  • SystemsVirtual memory

    Virtual memory, paging, and the TLB

    Page tables, TLB hits and misses, NUMA, and when to care about virtual memory internals — and when the abstraction is enough.

    Read the canonical page →

Looking for verified background and credentials? See the credentials block on the resume or the about page.

Mastodon