Sourcing — Feed

清除当前 3 条 / 共 3560 条

筛选已选

投研/平台

Alpha 派抓到：11 小时 15 分钟前 SemiAnalysis 抓到：3 天 8 小时前

官方/公司

OpenAI News 抓到：2 小时 21 分钟前 NVIDIA Technical Blog 抓到：8 小时 21 分钟前 Azure Blog 抓到：6 天 20 小时前 Google DeepMind Blog 抓到：8 天 2 小时前 Amazon Science 抓到：1 天 8 小时前 AWS ML Blog 抓到：1 天 2 小时前

微信公众号

微信公众号 · Founder Park 抓到：10 天 22 小时前微信公众号 · FundaAI 抓到：17 天 20 小时前微信公众号 · 九章智驾抓到：10 天 22 小时前微信公众号 · 晚点LatePost 抓到：10 天 22 小时前微信公众号 · 琢磨事抓到：24 天 16 小时前微信公众号 · 甲子光年抓到：21 天 6 小时前

重置

异常/暂停数据源 9

AI 基建 · 26 天 20 小时前微信公众号 · 42章经 · 4 天 15 小时前微信公众号 · DeepTech深科技 · 4 天 15 小时前微信公众号 · Founder Park · 4 天 15 小时前微信公众号 · FundaAI · 4 天 15 小时前微信公众号 · 九章智驾 · 4 天 15 小时前微信公众号 · 晚点LatePost · 4 天 15 小时前微信公众号 · 琢磨事 · 4 天 15 小时前微信公众号 · 甲子光年 · 4 天 15 小时前

3 @jeremyphoward: RT @ahatamiz1: Gated DeltaNet-2 is here. 🚀 🔥 New paper: Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Gated DeltaNet-2…

2026-05-22T00:58

新论文Gated DeltaNet-2发布，提出在线性注意力中解耦擦除和写入操作，是一项AI研究进展。

Gated DeltaNet-2论文正式发布
论文主题是解耦线性注意力中的擦除与写入

@jeremyphoward ↗ X AI 研究

3 @jeremyphoward: RT @haopeng_uiuc: Excited to share our new paper: RoPE Distinguishes Neither Positions Nor Tokens in Long Contexts, Provably LLMs often fa…

2026-05-19T22:47

伊利诺伊大学香槟分校研究人员发表论文，证明旋转位置编码（RoPE）在长上下文任务中既不能区分位置也不能区分token，对LLM长上下文理解提出挑战。

新论文证明RoPE在长上下文中无法区分位置和token

@jeremyphoward ↗ X AI 研究

3 @jeremyphoward: RT @NousResearch: Today we release Token Superposition Training (TST), a modification to the standard LLM pretraining loop that produces a…

2026-05-13T22:44

NousResearch 发布 Token Superposition Training (TST)，一种对标准大语言模型预训练循环的修改，旨在提升训练效果。该发布受到广泛关注，推文获得 2600 点赞、283 次转发。

NousResearch 发布 Token Superposition Training (TST)
TST 是一种对标准 LLM 预训练循环的修改
推文获得 2600 点赞、283 次转发

@jeremyphoward ↗ X AI 研究

1 共 1 页