Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published about 24 hours ago • 38
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling Paper • 2506.08672 • Published Jun 10 • 30
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space Paper • 2505.13308 • Published May 19 • 27
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6 • 188
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts Paper • 2503.22952 • Published Mar 29 • 17
Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models Paper • 2405.02861 • Published May 5, 2024 • 1
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models Paper • 2308.09729 • Published Aug 17, 2023 • 6