← 返回列表

@teortaxesTex: Huawei has finally credibly (?) pretrained a big LLM on Ascends. "hyper-node optimized training" suggests 950s I guess. Builds on DSA ("with...

@teortaxesTex 3 信息等级 3 1 噪音/剔除;2 较弱;3 普通事实;4 重要行业动态;5 极重大事件。该分数是信息显著性,不是投资建议。 发布:2026-06-12T14:53 抓取:2026-06-12 17:20
🔗 原文链接
摘要

华为在昇腾芯片上成功预训练了一个大语言模型,采用超节点优化训练和DSA技术,旨在证明其硬件能力。

客观事实
  • 华为在昇腾芯片上预训练大语言模型
  • 采用超节点优化训练和DSA技术
  • 华为意在证明其硬件可完成大模型训练
华为 昇腾

原文

Huawei has finally credibly (?) pretrained a big LLM on Ascends. "hyper-node optimized training" suggests 950s I guess. Builds on DSA ("with SWA"). They want to prove it can be done on their hardware.
What is ModAttn?
(pics from Reddit, some translations are off) https://t.co/sxJJrBwb49

likes: 53 | retweets: 1 | replies: 3 | views: 5719