Google DeepMind 发布 Gemini Omni 模型,这是首个能从任何输入生成任何输出的模型,首先从视频开始。该模型结合了 Gemini 的智能与生成媒体系统,代表了世界理解、多模态和编辑能力的飞跃。
We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video.
It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵
likes: 6053 | retweets: 851 | replies: 197 | views: 536535