MingComplex commited on
Commit
083041d
·
1 Parent(s): ab8da76

update readme

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -11,7 +11,13 @@ library_name: transformers
11
 
12
 
13
  # UI-TARS-2B-SFT
14
-
 
 
 
 
 
 
15
  ## Introduction
16
 
17
  UI-TARS is a next-generation native GUI agent model designed to interact seamlessly with graphical user interfaces (GUIs) using human-like perception, reasoning, and action capabilities. Unlike traditional modular frameworks, UI-TARS integrates all key components—perception, reasoning, grounding, and memory—within a single vision-language model (VLM), enabling end-to-end task automation without predefined workflows or manual rules.
 
11
 
12
 
13
  # UI-TARS-2B-SFT
14
+ [UI-TARS-2B-SFT](https://huggingface.co/bytedance-research/UI-TARS-2B-SFT)  | 
15
+ [**UI-TARS-2B-gguf**](https://huggingface.co/bytedance-research/UI-TARS-2B-gguf)  | 
16
+ [**UI-TARS-7B-SFT**](https://huggingface.co/bytedance-research/UI-TARS-7B-SFT)  | 
17
+ [**UI-TARS-7B-DPO**](https://huggingface.co/bytedance-research/UI-TARS-7B-DPO)  | 
18
+ [**UI-TARS-7B-gguf**](https://huggingface.co/bytedance-research/UI-TARS-7B-gguf)  | 
19
+ [**UI-TARS-72B-SFT**](https://huggingface.co/bytedance-research/UI-TARS-72B-SFT)  | 
20
+ [**UI-TARS-72B-DPO**](https://huggingface.co/bytedance-research/UI-TARS-72B-DPO)
21
  ## Introduction
22
 
23
  UI-TARS is a next-generation native GUI agent model designed to interact seamlessly with graphical user interfaces (GUIs) using human-like perception, reasoning, and action capabilities. Unlike traditional modular frameworks, UI-TARS integrates all key components—perception, reasoning, grounding, and memory—within a single vision-language model (VLM), enabling end-to-end task automation without predefined workflows or manual rules.