MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper • 2512.03041 • Published 9 days ago • 62
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 9 days ago • 200
alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Beta Image-to-Image • Updated Oct 12, 2024 • 12.2k • 419
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published 14 days ago • 71
Running on Zero MCP Featured 1.58k Qwen Image Edit Camera Control 🎬 1.58k Fast 4 step inference with Qwen Image Edit 2509
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published 30 days ago • 195
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6 • 208
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30 • 535
Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published Oct 17 • 48
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7 • 141
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published Oct 9 • 125