dynamo.nixl_connect.ReadOperation — NVIDIA Dynamo Documentation
Title: dynamo.nixl_connect.ReadOperation — NVIDIA Dynamo Documentation
Published Time: Thu, 18 Sep 2025 23:05:35 GMT
Markdown Content: dynamo.nixl_connect.ReadOperation#
An operation which transfers data from a remote worker to the local worker.
To create the operation, NIXL metadata (RdmaMetadata) from a remote worker’s ReadableOperation along with a matching set of local Descriptor objects which reference memory intended to receive data from the remote worker must be provided. The NIXL metadata must be transferred from the remote to the local worker via a secondary channel, most likely HTTP or TCP+NATS.
Once created, data transfer will begin immediately. Disposal of the object will instruct the NIXL subsystem to cancel the operation, therefore the operation should be awaited until completed unless cancellation is intended.
Example Usage#
async def read_from_remote(
self,
remote_metadata: dynamo.nixl_connect.RdmaMetadata,
local_tensor: torch.Tensor
) -> None:
descriptor = dynamo.nixl_connect.Descriptor(local_tensor)
with self.connector.begin_read(descriptor, remote_metadata) as read_op:
# Wait for the operation to complete writing data from the remote worker to local_tensor.
await read_op.wait_for_completion()
Methods#
cancel#
def cancel(self) -> None:
Instructs the NIXL subsystem to cancel the operation. Completed operations cannot be cancelled.
wait_for_completion#
async def wait_for_completion(self) -> None:
Blocks the caller until the memory from the remote worker has been transferred to the provided buffers.
Properties#
status#
@property def status(self) -> OperationStatus:
Returns OperationStatus which provides the current state (aka. status) of the operation.
Links/Buttons:
- Skip to main content
- document.write(
<img src="../../_static/nvidia-logo-horiz-rgb-wht-for-screen.svg" class="logo__image only-dark" alt="NVIDIA Dynamo Documentation - Home"/>); NVIDIA Dynamo Documentation - GitHub
- Installation
- Support Matrix
- Architecture
- Disaggregated Serving
- Examples
- Quickstart (K8s)
- Dynamo Operator
- Metrics
- Logging
- Multinode
- Minikube Setup
- Backends
- vLLM
- SGLang
- TensorRT-LLM
- Router
- Planner
- Pre-Deployment Profiling
- Load-based Planner
- SLA-based Planner
- KVBM
- Motivation
- KVBM Architecture
- Understanding KVBM components
- KVBM Further Reading
- LMCache Integration
- Tuning Disaggregated Serving Performance
- Writing Python Workers in Dynamo
- Glossary
- #
- RdmaMetadata
- ReadableOperation
- Descriptor
- OperationStatus
- Connector
- Device
- WritableOperation
- WriteOperation
- Privacy Policy
- Manage My Privacy
- Do Not Sell or Share My Data
- Terms of Service
- Accessibility
- Corporate Policies
- Product Security
- Contact