shenyunhang commited on
Commit
f5c166d
·
verified ·
1 Parent(s): 20495b2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md CHANGED
@@ -4,6 +4,35 @@ datasets:
4
  - VITA-MLLM/Long-VITA-Training-Data
5
  ---
6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  ## ACCEPTABLE USE POLICY
8
 
9
  Any license on the model is subject to your compliance with the Acceptable Use Policy, and You must not violate (or encourage or permit anyone else to violate) any term of the Acceptable Use Policy. Tencent reserves the right to update this Acceptable Use Policy from time to time.
 
4
  - VITA-MLLM/Long-VITA-Training-Data
5
  ---
6
 
7
+
8
+ # Long-VITA-128K
9
+
10
+ Github: https://github.com/VITA-MLLM/Long-VITA
11
+
12
+ ## 👀 Overview
13
+ Long-VITA is a strong long-context visual language model and supports more than 1 million tokens.
14
+
15
+ - This weight is trained on Ascend NPU with MindSpeed.
16
+
17
+ - To infer and evaluate on Nvidia GPU, we also implement Long-VITA on Megatron with Transformer Engine.
18
+
19
+ - The converted weight is in https://huggingface.co/VITA-MLLM/Long-VITA-128K_MG.
20
+
21
+
22
+ ## 📈 Experimental Results
23
+ - **Comparison of image understanding**.
24
+
25
+ ![image](https://github.com/user-attachments/assets/30f62f51-675e-4dac-9f18-f743c311f9be)
26
+
27
+
28
+
29
+ - **Comparison of video understanding**.
30
+
31
+ ![image](https://github.com/user-attachments/assets/fee848d5-da20-4a30-9172-2ec9746ada25)
32
+
33
+
34
+
35
+
36
  ## ACCEPTABLE USE POLICY
37
 
38
  Any license on the model is subject to your compliance with the Acceptable Use Policy, and You must not violate (or encourage or permit anyone else to violate) any term of the Acceptable Use Policy. Tencent reserves the right to update this Acceptable Use Policy from time to time.