SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published Jun 2 β’ 143
DeepSeek-V3.1 Collection DeepSeek's new 3.1 update to their V3 models! β’ 6 items β’ Updated 9 days ago β’ 7
Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper β’ 2506.09250 β’ Published Jun 10 β’ 27
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. β’ 59 items β’ Updated 9 days ago β’ 260
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper β’ 2505.02567 β’ Published May 5 β’ 80
Discovering symbolic expressions with parallelized tree search Paper β’ 2407.04405 β’ Published Jul 5, 2024 β’ 1
Models for Open Hands (Open Devin) Collection Models for Open Hands(Open Devin) trained on the Devinator Dataset. β’ 3 items β’ Updated Oct 25, 2024 β’ 3
LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context Paper β’ 2412.17596 β’ Published Dec 23, 2024 β’ 6
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. β’ 6 items β’ Updated Apr 14 β’ 16
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs Dec 4, 2024 β’ 80