SemiAnalysis发推称,在Cerebras上运行深度编码模型需24个系统(2400万美元资本支出)仅支持256并发用户,而同等资金下标准GB300机架能提供更多内存带宽。
Running a single deep coding model at max context on Cerebras requires 24 systems ($24M Capex) just to support 256 concurrent users. At that scale, $100M gets you way more memory bandwidth in standard GB300 racks. https://t.co/ilwBz0GMUW
likes: 118 | retweets: 3 | replies: 4 | views: 12930