NVIDIA Megatron Core 新增对 Muon、MOP 和 REKLS 等优化器的支持,旨在提升 GB300 GPU 和 NVL72 系统上训练 Kimi K2、Qwen3 30B 等大模型的效率。
Training Kimi K2 and Qwen3 30B-scale models efficiently requires more than standard data-parallel tricks.
NVIDIA Megatron Core now provides end-to-end support for emerging higher-order optimizers like Muon, alongside research optimizers such as MOP and REKLS, to push training efficiency on GB300 GPUs and NVL72 systems.
Full breakdown 👇 https://t.co/D7E55OnCiK
likes: 79 | retweets: 12 | replies: 5 | views: 5792