NVIDIA在首个智能体AI基准测试AA-AgentPerf中取得领先性能。该基准由Artificial Analysis推出,为行业首个多供应商开放基准,旨在衡量推理系统在智能体任务下的表现。
AI agents have fundamentally changed the complexity of inference workloads. Until now, the industry has struggled to define a standard for measuring how...AI agents have fundamentally changed the complexity of inference workloads. Until now, the industry has struggled to define a standard for measuring how inference systems perform under these conditions. Artificial Analysis AgentPerf (AA-AgentPerf) offers the industry’s first multi-vendor open benchmarks profiling trajectories that are representative of real-world AI agent coding tasks.
Source