SGL Project和Radixark团队优化了DeepSeek V4在B200和B300上的推理性能,并在GB300上实现了4倍交互吞吐量提升。
Amazing work from the @sgl_project and @radixark team for their work optimizing DeepSeek V4 inference on B200, B300, and the recent 4x iso-interactivity throughput improvements on GB300 by @ChengWan17! As @elonmusk said, The GB300 is the best AI computer, and software optimizations like this show its true potential!
likes: 143 | retweets: 21 | replies: 4 | views: 12417