Update README.md
Browse files
README.md
CHANGED
@@ -4,6 +4,35 @@ datasets:
|
|
4 |
- VITA-MLLM/Long-VITA-Training-Data
|
5 |
---
|
6 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
## ACCEPTABLE USE POLICY
|
8 |
|
9 |
Any license on the model is subject to your compliance with the Acceptable Use Policy, and You must not violate (or encourage or permit anyone else to violate) any term of the Acceptable Use Policy. Tencent reserves the right to update this Acceptable Use Policy from time to time.
|
|
|
4 |
- VITA-MLLM/Long-VITA-Training-Data
|
5 |
---
|
6 |
|
7 |
+
|
8 |
+
# Long-VITA-128K
|
9 |
+
|
10 |
+
Github: https://github.com/VITA-MLLM/Long-VITA
|
11 |
+
|
12 |
+
## 👀 Overview
|
13 |
+
Long-VITA is a strong long-context visual language model and supports more than 1 million tokens.
|
14 |
+
|
15 |
+
- This weight is trained on Ascend NPU with MindSpeed.
|
16 |
+
|
17 |
+
- To infer and evaluate on Nvidia GPU, we also implement Long-VITA on Megatron with Transformer Engine.
|
18 |
+
|
19 |
+
- The converted weight is in https://huggingface.co/VITA-MLLM/Long-VITA-128K_MG.
|
20 |
+
|
21 |
+
|
22 |
+
## 📈 Experimental Results
|
23 |
+
- **Comparison of image understanding**.
|
24 |
+
|
25 |
+

|
26 |
+
|
27 |
+
|
28 |
+
|
29 |
+
- **Comparison of video understanding**.
|
30 |
+
|
31 |
+

|
32 |
+
|
33 |
+
|
34 |
+
|
35 |
+
|
36 |
## ACCEPTABLE USE POLICY
|
37 |
|
38 |
Any license on the model is subject to your compliance with the Acceptable Use Policy, and You must not violate (or encourage or permit anyone else to violate) any term of the Acceptable Use Policy. Tencent reserves the right to update this Acceptable Use Policy from time to time.
|