vLLM项目宣布即日起支持Gemma4的MTP(多令牌预测),提供即用Docker镜像,解码速度可提升至3倍。
RT @vllm_project: 🚀 Day-0 MTP support for Gemma4 now available at vLLM with ready-to-use docker image!
⚡️Enjoy up to 3x faster decoding pe…
likes: 453 | retweets: 47 | replies: 9 | views: 37361