Allen AI发布ArtifactLinker新系统,用于预测模型应该评估哪些基准,旨在解决当前模型只在部分基准上评估的问题。
RT @allen_ai: Most models are only evaluated on a fraction of the benchmarks out there.
ArtifactLinker, our new system, predicts which one…
likes: 70 | retweets: 10 | replies: 5 | views: 20907