LongVideoAgent: Multi-Agent Reasoning with Long Videos Paper • 2512.20618 • Published 5 days ago • 49
LongVideoAgent: Multi-Agent Reasoning with Long Videos Paper • 2512.20618 • Published 5 days ago • 49
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding Paper • 2512.17532 • Published 9 days ago • 63
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding Paper • 2512.17532 • Published 9 days ago • 63
GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing Paper • 2403.05916 • Published Mar 9, 2024
Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference Paper • 2502.13542 • Published Feb 19 • 1
LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization Paper • 2506.09373 • Published Jun 11
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall Paper • 2510.07896 • Published Oct 9 • 1
An Incremental Unified Framework for Small Defect Inspection Paper • 2312.08917 • Published Dec 14, 2023
Learning to Remove Wrinkled Transparent Film with Polarized Prior Paper • 2403.04368 • Published Mar 7, 2024
Hawk: Learning to Understand Open-World Video Anomalies Paper • 2405.16886 • Published May 27, 2024 • 1