Posts | Sean Lobjoit

VRAM Swap, Two Weeks In: Multithreading and Killing the Deadlock

Sean Lobjoit·12 June 2026·5 min read

The nbd-vram daemon got a thread pool, ~4x concurrent 4K IOPS, and the swap-pressure freeze is finally gone. Fresh benchmarks against NVMe.

LinuxCUDANVIDIASystems

Read →

Retrieval Is the Hard Part of RAG, Not Generation

Sean Lobjoit·10 June 2026·4 min read

Most RAG accuracy problems are retrieval misses, not model failures. Here is how to measure and fix the part that actually breaks.

RAGAI InfrastructureVector DatabasesLLMs

Read →

Fitting an LLM Into VRAM That Isn't There

Sean Lobjoit·3 June 2026·3 min read

CUDA Unified Memory can extend effective VRAM for LLM inference. The catch is PCIe bandwidth, which makes it slower than just splitting layers to CPU for any meaningful overflow.

LinuxCUDANVIDIALLM

Read →

7GB of VRAM as Swap, No Kernel Module Required

Sean Lobjoit·1 June 2026·4 min read

NVIDIA's P2P API is silently blocked on consumer GPUs. I've found a path around it in the form of an NBD server over a Unix socket that turns VRAM into a real swap device.

LinuxCUDANVIDIASystems

Read →

Your Database Schema Is Not Zero-Downtime

Sean Lobjoit·25 May 2026·5 min read

Zero-downtime deployments don't protect you from schema migrations. Here's how table locks bring down production, and how to fix it.

DatabasesPostgresSRESystem Design

Read →

Page 1 of 6