TongUI Collection Open source our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials; https://github.com/TongUI-agent/TongUI-agent • 13 items • Updated Nov 3, 2025 • 3
TongUI Collection Open source our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials; https://github.com/TongUI-agent/TongUI-agent • 13 items • Updated Nov 3, 2025 • 3
Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning Paper • 2504.21561 • Published Apr 30, 2025 • 1
Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL Paper • 2505.15436 • Published May 21, 2025 • 2
Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning Paper • 2504.21561 • Published Apr 30, 2025 • 1
Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation Paper • 2509.23866 • Published Sep 28, 2025 • 14
Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation Paper • 2509.23866 • Published Sep 28, 2025 • 14