Anthropic研究PM Alex Albert分享了构建下一代Claude模型的五个要点:模型与外部工具紧密耦合设计、Claude自我审查记忆的‘梦境’机制、基于真实用户问题生成评估、设有研究Claude意识的团队,以及写作文化为模型提供上下文。
My top 5 takeaways from @alexalbert__ on how Anthropic is building the next Claude model:
The model and the harness are coupled. Each surface wraps the model in a different prompt and tool setup, so the same model can give different responses depending on where it runs. As a research PM, Alex has to think through how the model will perform across Claude, Cowork, Claude Code, and more.
When an agent isn't running a task, it reviews its own memories, finds contradictions, and prunes them. This “dreaming” process was inspired by how sleep helps humans process memory.
The research team uses Claude to cluster the firehose of user feedback into top themes, then generates synthetic versions of each user problem to turn into an eval. It's not just about volume either - even a few dozen well-written test cases can produce an eval for the model.
Anthropic has people whose whole job is to think about what it means for Claude to be a conscious actor. There's no official position on whether it is or isn't, but the question is taken seriously as agents take on more autonomous work.
Every written word at Anthropic becomes context Claude can pull later. From Alex: "Get things written down, make them accessible to Claude, because that's just more context that it has."
📌 Watch now: https://t.co/CJAakWq9Nd
likes: 13 | retweets: 0 | replies: 3 | views: 6739