Dynamo NIXL Connect — Dynamo
Title: Dynamo NIXL Connect — Dynamo
URL Source: https://docs.nvidia.com/dynamo/archive/0.4.0/API/nixl_connect/README.html?userAgent=PromptingBot%2F1.0.0
Published Time: Fri, 22 Aug 2025 17:35:03 GMT
Markdown Content: Dynamo NIXL Connect#
Dynamo connect provides utilities for using the NIXL base RDMA subsystem via a set of Python classes. The primary goal of this library to simplify the integration of NIXL based RDMA into inference applications. The dynamo.nixl_connect library can be imported by any Dynamo container hosted application.
import dynamo.nixl_connect
All operations using the NIXL Connect library begin with the Connector class and the type of operation required. There are four types of supported operations:
- Register local readable memory:
Register local memory buffer(s) with the RDMA subsystem to enable a remote worker to read from.
- Register local writable memory:
Register local memory buffer(s) with the RDMA subsystem to enable a remote worker to write to.
- Read from registered, remote memory:
Read remote memory buffer(s), registered by a remote worker to be readable, into local memory buffer(s).
- Write to registered, remote memory:
Write local memory buffer(s) to remote memory buffer(s) registered by a remote worker to writable.
By connecting correctly paired operations, high-throughput GPU Direct RDMA data transfers can be completed. Given the list above, the correct pairing of operations would be 1 & 3 or 2 & 4. Where one side is a “(read|write)-able operation” and the other is its correctly paired “(read|write) operation”. Specifically, a read operation must be paired with a readable operation, and a write operation must be paired with a writable operation.
Python Classes#
References#
Links/Buttons:
- Skip to main content
- document.write(
<img src="../../_static/nvidia-logo-horiz-rgb-wht-for-screen.svg" class="logo__image only-dark" alt="Dynamo - Home"/>); Dynamo - GitHub
- Support Matrix
- High Level Architecture
- Distributed Runtime
- Disaggregated Serving
- KV Block Manager
- Motivation
- KVBM Architecture
- Understanding KVBM components
- KVBM Further Reading
- KV Cache Routing
- Planner
- Pre-Deployment Profiling
- Load-based Planner
- SLA-based Planner
- Dynamo Architecture Flow
- Writing Python Workers in Dynamo
- Disaggregation and Performance Tuning
- Working with Dynamo Kubernetes Operator
- Dynamo Deploy Quickstart
- Dynamo Cloud Kubernetes Platform
- Manual Helm Deployment
- Minikube Setup Guide
- Model Caching with Fluid
- Hello World
- LLM Deployment Examples using VLLM
- LLM Deployment Examples using SGLang
- Multinode Examples using SGLang
- Planner Benchmark Example
- LLM Deployment Examples using TensorRT-LLM
- Glossary
- NIXL Connect API
- #
- Connector
- Descriptor
- Device
- ReadOperation
- ReadableOperation
- WritableOperation
- WriteOperation
- NVIDIA Dynamo
- NVIDIA Dynamo NIXL Connect
- NVIDIA Inference Transfer Library (NIXL)
- NVIDIA GPU Direct
- Privacy Policy
- Manage My Privacy
- Do Not Sell or Share My Data
- Terms of Service
- Accessibility
- Corporate Policies
- Product Security
- Contact