开源Marlin-2B视觉语言模型,用于从视频中提取结构化信息。该模型专注于两个问题的微调。
RT @HappyyPablo: open sourcing Marlin-2B 🐟 a tiny VLM to extract structured information from videos
Marlin is finetuned for two questions…
likes: 2840 | retweets: 347 | replies: 91 | views: 151010