Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models Paper • 2511.18890 • Published 15 days ago • 29
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 6 days ago • 116
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 8 days ago • 227
Block Cascading: Training Free Acceleration of Block-Causal Video Models Paper • 2511.20426 • Published 13 days ago • 9
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story Paper • 2511.15210 • Published 20 days ago • 86
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5 • 53
view article Article The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs 24 days ago • 11
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation Paper • 2511.13655 • Published 21 days ago • 9
SYNTH Collection Fully generalist synthetic dataset and SOTA small reasoners • 3 items • Updated 29 days ago • 11
GVE Collection Towards General Video Embeddings: Models and Benchmarks • 4 items • Updated Nov 3 • 19
Reasoning Language Model Inference Serving Unveiled: An Empirical Study Paper • 2510.18672 • Published Oct 21 • 7
LightOnOCR Collection The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR • 7 items • Updated 26 days ago • 14