Hugging Face的TRL库发布v1.4版本,新增chunked NLL损失用于监督微调,使用更少显存且速度更快,并提及Qwen3模型。
RT @QGallouedec: TRL v1.4 is out! two things I'm excited about:
→ chunked NLL loss for SFT. Way less VRAM, same loss, often faster. Qwen3-…
likes: 43 | retweets: 8 | replies: 4 | views: 13865