← 返回列表

@NVIDIAAI: Introducing Dynamo Snapshot, our approach for fast startup for inference workloads on Kubernetes, which reduces startup time from minutes to...

@NVIDIAAI 3 信息等级 3 1 噪音/剔除;2 较弱;3 普通事实;4 重要行业动态;5 极重大事件。该分数是信息显著性,不是投资建议。 发布:2026-05-27T23:56 抓取:2026-05-28 05:18
🔗 原文链接
摘要

NVIDIA 推出 Dynamo Snapshot 技术,用于 Kubernetes 上的推理工作负载快速启动,将启动时间从分钟级降至5秒以内。该技术利用 GMS 实现并发权重恢复,并加速 CRIU 恢复性能,旨在应对生产环境中推理部署的波动需求。

客观事实
  • Dynamo Snapshot 将启动时间从分钟级降至5秒内
  • 技术利用 GMS 实现并发权重恢复和加速 CRIU 恢复
  • 针对 Kubernetes 上推理工作负载的快速启动
NVIDIA Dynamo Snapshot Kubernetes

原文

Introducing Dynamo Snapshot, our approach for fast startup for inference workloads on Kubernetes, which reduces startup time from minutes to under 5 seconds.

In production inference deployments demand fluctuates over time. Cold-starting inference workloads can take minutes, leaving idle GPUs that generate no tokens and serve no requests.

Snapshot leverages GMS to enable concurrent weight restoration over a high-speed interconnect, while using Linux native AIO and parallel memfd restoration to accelerate CRIU restore performance.

likes: 188 | retweets: 33 | replies: 9 | views: 19382