Update README.md
Browse files
README.md
CHANGED
@@ -2,6 +2,8 @@
|
|
2 |
license: apache-2.0
|
3 |
datasets:
|
4 |
- VITA-MLLM/Long-VITA-Training-Data
|
|
|
|
|
5 |
---
|
6 |
|
7 |
|
@@ -14,21 +16,26 @@ Long-VITA is a strong long-context visual language model and supports more than
|
|
14 |
|
15 |
- This weight is trained on Ascend NPU with MindSpeed.
|
16 |
|
17 |
-
- To infer and evaluate on Nvidia
|
18 |
-
|
19 |
-
- The converted weight is in https://huggingface.co/VITA-MLLM/Long-VITA-128K_MG.
|
20 |
|
21 |
|
22 |
## 📈 Experimental Results
|
23 |
- **Comparison of image understanding**.
|
24 |
|
25 |
-

|
26 |
+

|
27 |
|
28 |
|
29 |
- **Comparison of video understanding**.
|
30 |
|
31 |
+

|
32 |
+
|
33 |
+

|
34 |
+
|
35 |
+
|
36 |
+
- **Effectiveness of Logits-Masked LM Head**.
|
37 |
+
|
38 |
+

|
39 |
|
40 |
|
41 |
|