One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
upvoted
a
collection
about 8 hours ago
OpenScholar_V1
authored
a paper
about 10 hours ago
Code2World: A GUI World Model via Renderable Code Generation
upvoted
a
paper
about 12 hours ago
Improving Data and Reward Design for Scientific Reasoning in Large Language Models