view article Article 📌 Rethinking Multimodality from an Industry Perspective: Captioning Is Far More Important Than You Think 10 days ago • 3
CaptionQA: Is Your Caption as Useful as the Image Itself? Paper • 2511.21025 • Published 13 days ago • 25
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders Paper • 2510.19779 • Published Oct 22 • 60
QuantV2X: A Fully Quantized Multi-Agent System for Cooperative Perception Paper • 2509.03704 • Published Sep 3 • 2
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity Paper • 2506.16500 • Published Jun 19 • 17
Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals Paper • 2506.02281 • Published Jun 2 • 4
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation Paper • 2505.18875 • Published May 24 • 42
Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published Apr 21 • 44
Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment Paper • 2406.12303 • Published Jun 18, 2024 • 4
Looking Backward: Streaming Video-to-Video Translation with Feature Banks Paper • 2405.15757 • Published May 24, 2024 • 15
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation Paper • 2312.12491 • Published Dec 19, 2023 • 74
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Paper • 2311.04391 • Published Nov 7, 2023 • 14