Google为开源生产Kubernetes分布式推理工具llm-d添加了夜间CI。TPU在llm-d的CI和代码质量上正在追赶NVIDIA。AMD尚未将其GPU或NIC加入该CI。
TPU ALERT: For OSS production Kubernetes distributed inferencing, Google just added nightly CI for llm-d. Great step by Google to start enabling the wider ML community for TPUs. TPU is catching up to NVIDIA for llm-d CI & code quality. In comparison, although AMD's official recommended production kubernetes inferencing solution is llm-d, @AnushElangovan has yet to add any AMD GPUs or AMD NICs into the CI.
likes: 88 | retweets: 11 | replies: 2 | views: 11240