Multimodal - MLX Collection Language Models that takes vision input and/or audio input, hand picked by Nexa Team. • 9 items • Updated Nov 25, 2025 • 3
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated about 20 hours ago • 550