⚡ Fridays with Faraday
  • Posts
  • Categories
  • Series
  • About
  • Posts
  • Categories
  • Series
  • About
ESC

Start typing to search...

📊

Profiling

Performance analysis with perf, DTrace, eBPF, and systematic debugging.

2 articles

profiling expert

Production LLM Profiling with eBPF: Beyond nvidia-smi

Using BPFtrace and custom eBPF programs to trace CUDA runtime behavior, understand GPU scheduling latencies, and diagnose inference performance issues that nvidia-smi can't reveal.

Nov 12, 2024 · 24 min read
#ebpf#bpftrace#cuda +3
profiling intermediate

GPU Memory Profiling: Finding Leaks and Fragmentation

Practical techniques for diagnosing GPU memory issues using PyTorch memory profiling APIs, including allocation tracking, fragmentation analysis, and memory snapshot debugging.

Nov 4, 2024 · 15 min read
#memory#profiling#pytorch +2
← All Categories
⚡ Fridays with Faraday

Deep technical explorations in systems performance optimization, from bare-metal microcontrollers to large-scale LLM inference systems.

Categories

  • Microcontrollers
  • vLLM
  • LLM Inference
  • Hardware
  • Profiling
  • GPU Programming

Resources

  • About
  • RSS Feed
  • Sitemap
  • GitHub

© 2025 Fridays with Faraday. Built with Astro.

"Measure. Optimize. Repeat."