Sourcing — Feed

清除当前 7 条 / 共 3560 条

筛选已选

投研/平台

Alpha 派抓到：11 小时 15 分钟前 SemiAnalysis 抓到：3 天 8 小时前

官方/公司

OpenAI News 抓到：2 小时 21 分钟前 NVIDIA Technical Blog 抓到：8 小时 21 分钟前 Azure Blog 抓到：6 天 20 小时前 Google DeepMind Blog 抓到：8 天 2 小时前 Amazon Science 抓到：1 天 8 小时前 AWS ML Blog 抓到：1 天 2 小时前

微信公众号

微信公众号 · Founder Park 抓到：10 天 22 小时前微信公众号 · FundaAI 抓到：17 天 20 小时前微信公众号 · 九章智驾抓到：10 天 22 小时前微信公众号 · 晚点LatePost 抓到：10 天 22 小时前微信公众号 · 琢磨事抓到：24 天 16 小时前微信公众号 · 甲子光年抓到：21 天 6 小时前

重置

异常/暂停数据源 9

AI 基建 · 26 天 20 小时前微信公众号 · 42章经 · 4 天 15 小时前微信公众号 · DeepTech深科技 · 4 天 15 小时前微信公众号 · Founder Park · 4 天 15 小时前微信公众号 · FundaAI · 4 天 15 小时前微信公众号 · 九章智驾 · 4 天 15 小时前微信公众号 · 晚点LatePost · 4 天 15 小时前微信公众号 · 琢磨事 · 4 天 15 小时前微信公众号 · 甲子光年 · 4 天 15 小时前

3 @jeremyphoward: RT @menhguin: glad to know Mythos' safety concerns have been addressed right as Anthropic also secured tens of billions in inference comput…

2026-05-29T00:06

Mythos的安全问题已解决，同时Anthropic获得了数百亿规模的推理计算资源。

Mythos的安全问题已得到解决。
Anthropic获得了数百亿推理计算资源。

@jeremyphoward ↗ X AI 算力

3 @jeremyphoward: RT @_LuoFuli: Behind the MiMo API Price Reduction: The deepest price cut, up to 99%, is for Input (Cache Hit). The core reason is our infer…

2026-05-27T21:19

MiMo API进行价格下调，最高降幅达99%针对Input (Cache Hit)，核心原因是推理效率提升。

MiMo API价格下调，最高降幅99%针对Input (Cache Hit)
价格下调核心原因是推理效率提升

@jeremyphoward ↗ X AI 算力云计算

3 @jeremyphoward: RT @HanGuo97: LLM training is built on fast MatMuls. But many surrounding ops still run as memory-bound kernels. CODA reparameterizes them…

2026-05-22T04:01

推文指出LLM训练依赖快速矩阵乘法，但许多周围操作仍受内存限制。CODA方法对这些内核进行重新参数化优化。

LLM训练中许多周围操作是内存受限的内核
CODA重新参数化这些内存受限的内核

@jeremyphoward ↗ X AI 算力

3 @jeremyphoward: RT @ctnzr: We've gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and also pret…

2026-05-15T22:05

Nvidia发布Nemotron 3 Super和Ultra模型，参数规模分别为120B和约500B，均预训练在NVFP4格式下，其中Super使用了25T tokens。

Nemotron 3 Super参数120B，预训练25T tokens，NVFP4格式。
Nemotron 3 Ultra参数约500B，同样预训练于NVFP4。

@jeremyphoward ↗ X AI 算力行业

3 @jeremyphoward: This is misleading. This policy redefines the term "interactive" to mean "using an Anthropic front-end". If you use `claude -p` or Agent S...

2026-05-13T21:59

Anthropic更新政策，重新定义“交互式”为使用其前端，导致通过claude -p或Agent SDK的交互操作消耗积分而非订阅限制。

Anthropic重新定义“交互式”为使用其前端。
使用claude -p或Agent SDK消耗积分而非订阅。

@jeremyphoward ↗ X AI 动态算力

3 @jeremyphoward: RT @antirez: Welcome to DS4, a specialized inference engine for DeepSeek v4 Flash. https://t.co/UrUJz5I2R1 This project would have been im…

2026-05-07T20:14

Antirez宣布推出DS4，这是一个专为DeepSeek v4 Flash设计的推理引擎。项目进展顺利。

DS4是DeepSeek v4 Flash的专用推理引擎
该引擎已正式发布

@jeremyphoward ↗ X AI 算力

3 @jeremyphoward: RT @vllm_project: 🚀 Day-0 MTP support for Gemma4 now available at vLLM with ready-to-use docker image! ⚡️Enjoy up to 3x faster decoding pe…

2026-05-06T03:50

vLLM项目宣布即日起支持Gemma4的MTP（多令牌预测），提供即用Docker镜像，解码速度可提升至3倍。

vLLM支持Gemma4的MTP功能
提供即用Docker镜像
解码速度提升至3倍

@jeremyphoward ↗ X AI 算力动态

1 共 1 页