Cohere 的 Command A+ 模型在 Hugging Face 上线,支持 W4A4 量化,可大幅降低服务占用且几乎无性能损失。
RT @cohere: Command A+ is available on @huggingface with W4A4 quantization 🤗
Cut your serving footprint dramatically with virtually zero p…
likes: 123 | retweets: 20 | replies: 8 | views: 26024