LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation Paper • 2511.03001 • Published Nov 4, 2025 • 46
BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback Paper • 2509.21106 • Published Sep 25, 2025 • 7
BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback Paper • 2509.21106 • Published Sep 25, 2025 • 7
THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation Paper • 2406.10996 • Published Jun 16, 2024 • 35
PRINCIPLES: Synthetic Strategy Memory for Proactive Dialogue Agents Paper • 2509.17459 • Published Sep 22, 2025