Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models Paper • 2510.14853 • Published Oct 16, 2025 • 4
Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models Paper • 2510.14961 • Published Oct 16, 2025 • 7
MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation Paper • 2508.11032 • Published Aug 14, 2025 • 2
GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching Paper • 2506.20480 • Published Jun 25, 2025 • 7