开源了一个名为Marlin-2B的小型视觉语言模型,用于从视频中提取结构化信息。
RT @HappyyPablo: open sourcing Marlin-2B 🐟 a tiny VLM to extract structured information from videos
Marlin is finetuned for two questions…
likes: 130 | retweets: 33 | replies: 6 | views: 5261