Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning Paper • 2510.18849 • Published Oct 21, 2025 • 20
A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning Paper • 2510.12838 • Published Oct 13, 2025 • 24
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems Paper • 2510.11652 • Published Oct 13, 2025 • 29
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published Aug 6, 2025 • 129