MIND-V: Hierarchical Video Generation for Long-Horizon Robotic Manipulation with RL-based Physical Alignment Paper • 2512.06628 • Published 4 days ago • 9
MIND-V: Hierarchical Video Generation for Long-Horizon Robotic Manipulation with RL-based Physical Alignment Paper • 2512.06628 • Published 4 days ago • 9
AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement Paper • 2511.23475 • Published 12 days ago • 41 • 4
AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement Paper • 2511.23475 • Published 12 days ago • 41
Controllable Layer Decomposition for Reversible Multi-Layer Image Generation Paper • 2511.16249 • Published 20 days ago • 8
Controllable Layer Decomposition for Reversible Multi-Layer Image Generation Paper • 2511.16249 • Published 20 days ago • 8 • 2
GIR-Bench: Versatile Benchmark for Generating Images with Reasoning Paper • 2510.11026 • Published Oct 13 • 17
Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation Paper • 2509.18824 • Published Sep 23 • 22