NVIDIA宣布Step 3.7 Flash模型正式发布,该模型为198B参数MoE架构,11B活跃参数,支持256K上下文以及原生图像和视频处理。即日起可通过NVIDIA NIM推理微服务和NeMo框架在GPU加速端点部署。
Step 3.7 Flash is here
ICYMI: 198B MoE with 11B active params, 256K context, native image + video support.
Day 0 support is live on https://t.co/6T0R9P778k with GPU-accelerated endpoints, deploy with NVIDIA NIM inference microservices, and fine-tune with the NVIDIA NeMo framework.
Congrats to the @stepfun_ai team!
likes: 204 | retweets: 21 | replies: 10 | views: 14397