推文提及DeepSeek V4项目,其目标之一是实现在极端序列长度下的极低推理成本。
RT @teortaxesTex: There are two parts of DeepSeek V4 project. - How do we make inference very cheap, even at extreme sequence lengths and…
likes: 23 | retweets: 1 | replies: 0 | views: 6839