← 返回列表

@swyx: IMO DeepSeek v4 demonstrated utter confidence and competence by not benchmaxxing, not focusing on some BS final run cost, not even spending ...

@swyx 3 信息等级 3 1 噪音/剔除;2 较弱;3 普通事实;4 重要行业动态;5 极重大事件。该分数是信息显著性,不是投资建议。 发布:2026-04-29T19:21 抓取:2026-05-03 15:25
🔗 原文链接
摘要

DeepSeek v4 发布,展示长上下文效率技术 CSA、HCA、mHC 等,成本仅为 pro 版本的 8%,并推出最佳开源基础模型。

客观事实
  • DeepSeek v4 展示了长上下文效率技术
  • 其成本仅为 pro 版本的 8%
  • 发布了最佳开源基础模型
DeepSeek

原文

IMO DeepSeek v4 demonstrated utter confidence and competence by not benchmaxxing, not focusing on some BS final run cost, not even spending inference-optimal compute.

just showed up, demonstrated SOTA long context efficiency techniques (CSA, HCA, mHC, flash at 8% cost of pro, which itself is 14% cost of opus), dropped the best open base models in the world, peaced out.

BYO posttraining. leave that to the agent labs to pick up the scraps. bravo.

likes: 1356 | retweets: 71 | replies: 66 | views: 103683