profiling expert
Production LLM Profiling with eBPF: Beyond nvidia-smi
Using BPFtrace and custom eBPF programs to trace CUDA runtime behavior, understand GPU scheduling latencies, and diagnose inference performance issues that nvidia-smi can't reveal.
profiling intermediate
GPU Memory Profiling: Finding Leaks and Fragmentation
Practical techniques for diagnosing GPU memory issues using PyTorch memory profiling APIs, including allocation tracking, fragmentation analysis, and memory snapshot debugging.