Technical Articles - Page 3

Deep dive into machine learning, computer vision, and software engineering. Expert insights on AI, local LLMs, quantization, and practical implementation details from real-world projects.

Articles - Page 3

GPU Boot Errors: initramfs and Driver Conflicts

January 25, 2025

Fix Linux GPU boot errors: nouveau vs NVIDIA driver conflicts, initramfs solutions, and the early driver loading chicken-and-egg problem.

linux gpu nvidia+12

January 25, 2025

Interactive H.264 Guide: Video Compression Visuals

January 24, 2025

Interactive H.264 video compression guide with visualizations. Explore motion estimation, DCT transforms, quantization, and rate-distortion optimization.

h264 video compression codec+13

January 24, 2025

GGML File Structure: Quantized Model Format Guide

January 22, 2025

Understand GGML file structure and quantization formats used by local LLMs. Visual guide to how llama.cpp stores and loads model weights efficiently.

ggml llm weight files+4

January 22, 2025

Quantization Deep Dive: From FP32 to INT4

January 9, 2025

Master neural network quantization with interactive visualizations. Explore QAT, PTQ, GPTQ, AWQ, and SmoothQuant methods for efficient model deployment.

Quantization Model Compression INT8+7

January 9, 2025

How TensorRT Works: NVIDIA Inference Optimization

January 8, 2025

Explore TensorRT optimization: layer fusion, INT8 quantization, kernel auto-tuning, and deployment strategies with 8+ interactive visualizations.

TensorRT GPU Optimization Deep Learning+5

January 8, 2025

Kernel Fusion: Boosting Neural Network Performance

December 12, 2024

Dive deep into Kernel Fusion, a technique that combines multiple neural network operations into unified kernels improving performance in deep learning models.

kernel fusion neural networks performance+4

December 12, 2024

Showing 13-18 of 27 articles