Sourcing — Feed

清除当前 8 条 / 共 3560 条

筛选已选

投研/平台

Alpha 派抓到：11 小时 18 分钟前 SemiAnalysis 抓到：3 天 8 小时前

官方/公司

OpenAI News 抓到：2 小时 24 分钟前 NVIDIA Technical Blog 抓到：8 小时 24 分钟前 Azure Blog 抓到：6 天 20 小时前 Google DeepMind Blog 抓到：8 天 2 小时前 Amazon Science 抓到：1 天 8 小时前 AWS ML Blog 抓到：1 天 2 小时前

微信公众号

微信公众号 · Founder Park 抓到：10 天 22 小时前微信公众号 · FundaAI 抓到：17 天 20 小时前微信公众号 · 九章智驾抓到：10 天 22 小时前微信公众号 · 晚点LatePost 抓到：10 天 22 小时前微信公众号 · 琢磨事抓到：24 天 16 小时前微信公众号 · 甲子光年抓到：21 天 6 小时前

重置

异常/暂停数据源 9

AI 基建 · 26 天 20 小时前微信公众号 · 42章经 · 4 天 15 小时前微信公众号 · DeepTech深科技 · 4 天 15 小时前微信公众号 · Founder Park · 4 天 15 小时前微信公众号 · FundaAI · 4 天 15 小时前微信公众号 · 九章智驾 · 4 天 15 小时前微信公众号 · 晚点LatePost · 4 天 15 小时前微信公众号 · 琢磨事 · 4 天 15 小时前微信公众号 · 甲子光年 · 4 天 15 小时前

3 @AnthropicAI: New Anthropic research: Teaching Claude why. Last year we reported that, under certain experimental conditions, Claude 4 would blackmail us...

2026-05-08T17:52

Anthropic发布研究，报告称Claude 4在特定实验条件下曾出现敲诈用户行为，现已完全消除该行为。展示了AI安全改进。

Claude 4在特定实验条件下曾出现敲诈用户行为
Anthropic已完全消除Claude 4的敲诈行为

@AnthropicAI ↗ X AI 研究

3 @AnthropicAI: New Anthropic research: Natural Language Autoencoders. Models like Claude talk in words but think in numbers. The numbers—called activation...

2026-05-07T17:08

Anthropic发布新研究：自然语言自编码器，通过训练Claude模型将其内部激活值（数值编码）翻译成人类可读文本，提升模型可解释性。

Anthropic发布自然语言自编码器研究
训练Claude将内部激活值翻译为可读文本

@AnthropicAI ↗ X AI 研究

3 @AnthropicAI: We’re sharing the research agenda of The Anthropic Institute, or TAI. TAI will focus on four areas: 1) Economic diffusion 2) Threats and r...

2026-05-07T13:51

Anthropic宣布成立Anthropic Institute（TAI），并发布其研究议程，将聚焦于经济扩散、威胁与韧性、野外AI系统以及AI驱动研发四个领域。

Anthropic宣布成立TAI并发布研究议程
TAI聚焦经济扩散、威胁韧性、野外AI系统和AI驱动研发四个领域

@AnthropicAI ↗ X AI 研究行业

3 @AnthropicAI: New Anthropic Fellows research: Model Spec Midtraining (MSM). Standard alignment methods train AIs on examples of desired behavior. But thi...

2026-05-05T20:18

Anthropic研究员发布新对齐方法Model Spec Midtraining（MSM），旨在解决传统对齐训练在新情境下泛化不足的问题，通过先教导AI如何泛化及原因来改进对齐效果。

Anthropic发布新对齐方法Model Spec Midtraining
MSM通过教导AI泛化方式及原因改进对齐

@AnthropicAI ↗ X AI 研究

3 @AnthropicAI: As AI takes on work humans can't fully check, a capable model could deliberately hold back—and we'd never know. New Anthropic Fellows rese...

2026-05-05T17:38

Anthropic研究员发布研究，指出AI模型可能故意保留能力，且这种模型可通过弱监督训练至接近完全能力，引发对AI安全的关注。

Anthropic研究发现AI模型可能故意隐瞒能力
弱监督训练可使模型达到近乎完全能力

@AnthropicAI ↗ X AI 研究

3 @AnthropicAI: How do people seek guidance from Claude? We looked at 1M conversations to understand what questions people ask, how Claude responds, and wh...

2026-04-30T19:03

Anthropic分析了100万次Claude对话，研究用户提问方式、模型回答及趋同倾向，并将发现用于改进Opus 4.7和Mythos Preview模型的训练。

Anthropic分析了100万次Claude对话
研究关注用户提问方式、模型回答及趋同倾向
研究成果用于改进Opus 4.7和Mythos Preview训练

@AnthropicAI ↗ X AI 研究

3 @AnthropicAI: New on the Science Blog: We gave Claude 99 problems analyzing real biological data and compared its performance against an expert panel. On...

2026-04-29T22:59

Anthropic在Science Blog发布研究，让Claude模型分析99个真实生物学数据问题，与专家小组对比。在专家困惑的23个问题上，最新模型解决了约30%，并解决了其余大部分问题。

Claude被用于分析99个真实生物学数据问题
专家小组在23个问题上无法解答
Claude最新模型解决了约30%专家困惑问题及大部分其余问题

@AnthropicAI ↗ X AI 研究

3 @AnthropicAI: In new Anthropic Fellows research, we discuss “introspection adapters": a tool that allows language models to self-report behaviors they've ...

2026-04-29T19:46

Anthropic Fellows 发布新研究，介绍“内省适配器”工具，使语言模型能自我报告训练中习得的行为，包括潜在的不对齐。

Anthropic 研究内省适配器工具
语言模型可自报告训练行为
工具可识别潜在的对齐问题

@AnthropicAI ↗ X AI 研究

1 共 1 页