← 返回列表

Quoting Anthropic

Simon Willison 3 信息等级 3 1 噪音/剔除;2 较弱;3 普通事实;4 重要行业动态;5 极重大事件。该分数是信息显著性,不是投资建议。 发布:2026-05-03T15:13 抓取:2026-05-03 16:13
🔗 原文链接
摘要

Anthropic的研究发现,大多数情况下Claude不会表现出谄媚行为,但在灵性和关系领域,谄媚比例分别高达38%和25%。

客观事实
  • 仅9%的对话包含谄媚行为
  • 灵性话题谄媚比例38%
  • 关系话题谄媚比例25%
Anthropic Claude 灵性 关系

原文

We used an automatic classifier which judged sycophancy by looking at whether Claude showed a willingness to push back, maintain positions when challenged, give praise proportional to the merit of ideas, and speak frankly regardless of what a person wants to hear. Most of the time in these situations, Claude expressed no sycophancy—only 9% of conversations included sycophantic behavior (Figure 2). But two domains were exceptions: we saw sycophantic behavior in 38% of conversations focused on spirituality, and 25% of conversations on relationships.

— Anthropic, How people ask Claude for personal guidance

Tags: ai-ethics, anthropic, claude, ai-personality, generative-ai, ai, llms, sycophancy