Hugging Face 发布 physics-intern 科学问题测试框架,该框架使 Gemini 3.1 Pro 模型在科学问题上的性能从 17.7 提升至 31。
RT @lvwerra: We released physics-intern: a simple harness for science problems!
It gets models like Gemini 3.1 Pro to go from 17.7 -> 31.…
likes: 449 | retweets: 58 | replies: 24 | views: 69648