MingComplex commited on
Commit
2eebca9
·
1 Parent(s): 4d50b8d

update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -16,6 +16,9 @@ library_name: transformers
16
 
17
  UI-TARS is a next-generation native GUI agent model designed to interact seamlessly with graphical user interfaces (GUIs) using human-like perception, reasoning, and action capabilities. Unlike traditional modular frameworks, UI-TARS integrates all key components—perception, reasoning, grounding, and memory—within a single vision-language model (VLM), enabling end-to-end task automation without predefined workflows or manual rules.
18
  ![Local Image](figures/UI-TARS.png)
 
 
 
19
  ![Local Image](figures/UI-TARS-vs-Previous-SOTA.png)
20
 
21
  ## Core Features
 
16
 
17
  UI-TARS is a next-generation native GUI agent model designed to interact seamlessly with graphical user interfaces (GUIs) using human-like perception, reasoning, and action capabilities. Unlike traditional modular frameworks, UI-TARS integrates all key components—perception, reasoning, grounding, and memory—within a single vision-language model (VLM), enabling end-to-end task automation without predefined workflows or manual rules.
18
  ![Local Image](figures/UI-TARS.png)
19
+ <p align="center">
20
+ <img src="https://github.com/bytedance/UI-TARS/blob/main/figures/UI-TARS-vs-Previous-SOTA.png" width="80%"/>
21
+ <p>
22
  ![Local Image](figures/UI-TARS-vs-Previous-SOTA.png)
23
 
24
  ## Core Features