-
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Paper • 2410.05243 • Published • 18 -
GPT-4V(ision) is a Generalist Web Agent, if Grounded
Paper • 2401.01614 • Published • 22 -
osunlp/UGround
Image-Text-to-Text • Updated • 1.69k • 21 -
osunlp/UGround-V1-2B
Image-Text-to-Text • Updated • 1.02k • 7
Boyu Gou
BoyuNLP
AI & ML interests
AI Agents, Foundation Models, GUI Agents
Recent Activity
liked
a dataset
1 day ago
osunlp/UGround-V1-Data
updated
a collection
1 day ago
My/Our Work
updated
a dataset
3 days ago
osunlp/UGround-V1-Data
Organizations
Collections
1
models
None public yet