HuggingFaceTB/SmolVLM-Synthetic
Image-Text-to-Text
•
2B
•
Updated
•
131
•
12
Exploring smol models (for text, vision and video) and high quality web and synthetic datasets