NVIDIA宣布其Nemotron 3 Super模型在EnterpriseOps-Gym排行榜开源类别中排名第一。该排行榜通过1150项任务和512个功能工具评估企业级AI代理性能。
Benchmarks should reflect real-world performance.
That’s why we’re excited to share that Nemotron 3 Super has topped the open source category on the EnterpriseOps-Gym leaderboard.
This agentic gauntlet evaluates performance across 1,150 tasks in fully interactive environments with 512 functional tools, requiring agents to coordinate across multiple enterprise systems and tools to complete a single workflow.
📊 https://t.co/wt54NRNgeK
likes: 153 | retweets: 21 | replies: 23 | views: 12357