← 返回列表

@levie: Gemini 3.5 Flash is out, and it's a major jump over Gemini 3 Flash in model capability for knowledge work. We've been evaluating it on our B...

@levie 3 信息等级 3 1 噪音/剔除;2 较弱;3 普通事实;4 重要行业动态;5 极重大事件。该分数是信息显著性,不是投资建议。 发布:2026-05-19T18:29 抓取:2026-05-19 23:19
🔗 原文链接
摘要

谷歌发布Gemini 3.5 Flash模型,在Box AI复杂文档任务评估中性能显著提升,较Gemini 3 Flash平均提升12个百分点。各行业测试结果均有所增长,其中医疗和生命科学领域提升超20个百分点。模型即将在Box AI Studio和API中可用,Box MCP服务器也将集成。

客观事实
  • Gemini 3.5 Flash模型发布,性能较上一代大幅提升
  • 在Box AI复杂文档任务上平均提升12个百分点
  • 医疗行业提升22个百分点,公共部门提升17个百分点
Gemini 3.5 Flash Box AI Gemini 3 Flash Box AI Studio Box API Box MCP Server

原文

Gemini 3.5 Flash is out, and it's a major jump over Gemini 3 Flash in model capability for knowledge work. We've been evaluating it on our Box AI Complex Work Eval in early release, and the model delivers a 12 percentage point jump on complex document tasks.

For testing this model, we give the Box AI Agent (using Gemini 3.5) complex problems to solve that represent common but difficult knowledge worker tasks in banking, consulting, public sector, healthcare, and other industries. These tasks can be things like drafting reports, doing due diligence, and more, given a set of relevant documents.

In our tests, Gemini 3.5 Flash delivered jumps across every industry, including:

  • Financial services: 81% vs 73% (+8pp)
  • Public sector: 76% vs 59%, (+17pp)
  • Healthcare: 73% vs 51%, (+22pp)
  • Life Sciences: 67% vs 47%, (+20pp)

Incredible to see the continued performance gains.

Gemini 3.5 Flash will be available soon in Box AI Studio and through the Box API. The Box MCP Server will soon be available in the Gemini app with more details to come.

likes: 144 | retweets: 11 | replies: 23 | views: 14052