ScreenAgent: A Vision Language Model-driven Computer Control Agent Paper • 2402.07945 • Published Feb 9, 2024
Towards Next-Generation LLM-based Recommender Systems: A Survey and Beyond Paper • 2410.19744 • Published Oct 10, 2024
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World Paper • 2505.19095 • Published May 25 • 1