OpenAI宣布向ChatGPT Plus和Team用户以及API Tier 5开发者推出o1-preview和o1-mini模型。这些模型通过私有思维链进行训练,在回答前先思考,从而提升复杂推理能力,并带来安全性和对齐方面的新进展。
Today we rolled out OpenAI o1-preview and o1-mini to all ChatGPT Plus/Team users & Tier 5 developers in the API.
o1 marks the start of a new era in AI, where models are trained to "think" before answering through a private chain of thought. The more time they take to think, the better they handle complex reasoning. We're no longer limited by pretraining paradigm; now, we can scale through inference compute, opening up new possibilities for capabilities and alignment.
Chain-of-thought offers new opportunities for advances in safety and alignment research by making the model's reasoning transparent—allowing us to observe its thought process step by step—and enabling it to actively reason about safety rules, which makes it more resilient in unexpected or novel situations.
Looking forward to seeing what you build, create, and discover.
likes: 0 | retweets: 0 | replies: 0 | views: 401192