nvidia.com

Command Palette

Search for a command to run...

KV Block Manager — NVIDIA Dynamo Documentation

Last updated: 12/12/2025

Title: KV Block Manager — NVIDIA Dynamo Documentation

URL Source: https://docs.nvidia.com/dynamo/archive/0.6.1/kvbm/kvbm_intro.html

Published Time: Sat, 08 Nov 2025 00:28:54 GMT

Markdown Content: Skip to main content

Back to top Ctrl+K

Image 1: NVIDIA Dynamo Documentation - HomeImage 2: NVIDIA Dynamo Documentation - Home NVIDIA Dynamo Documentation

latest

latest0.6.10.6.00.5.10.5.00.4.10.4.00.3.20.3.10.3.00.2.10.2.0

Search Ctrl+K

Search Ctrl+K

Image 3: NVIDIA Dynamo Documentation - HomeImage 4: NVIDIA Dynamo Documentation - Home NVIDIA Dynamo Documentation

latest

latest0.6.10.6.00.5.10.5.00.4.10.4.00.3.20.3.10.3.00.2.10.2.0

Table of Contents

Getting Started

Kubernetes Deployment

User Guides

Components

Design Docs

KV Block Manager#

The Dynamo KV Block Manager (KVBM) is a scalable runtime component designed to handle memory allocation, management, and remote sharing of Key-Value (KV) blocks for inference tasks across heterogeneous and distributed environments. It acts as a unified memory layer for frameworks like vLLM, SGLang, and TRT-LLM.

It offers:

  • A unified memory API that spans GPU memory(in future) , pinned host memory, remote RDMA-accessible memory, local or distributed pool of SSDs and remote file/object/cloud storage systems.

  • Support for evolving block lifecycles (allocate → register → match) with event-based state transitions that storage can subscribe to.

  • Integration with NIXL, a dynamic memory exchange layer used for remote registration, sharing, and access of memory blocks over RDMA/NVLink.

The Dynamo KV Block Manager serves as a reference implementation that emphasizes modularity and extensibility. Its pluggable design enables developers to customize components and optimize for specific performance, memory, and deployment needs.

Feature
BackendLocal
Kubernetes
LLM FrameworkvLLM
TensorRT-LLM
SGLang
Serving TypeAggregated
Disaggregated

previous SLA-based Plannernext Motivation behind KVBM

Image 5: NVIDIAImage 6: NVIDIA

Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2024-2025, NVIDIA CORPORATION & AFFILIATES.

Image 7Image 8Image 9

Links/Buttons:

Related Articles