metadata
license: apache-2.0
Long-VITA-128K
Github: https://github.com/VITA-MLLM/Long-VITA
- This is the converted weight from https://huggingface.co./VITA-MLLM/Long-VITA-128K.
- The original weight is trained on Ascend NPU with MindSpeed.
- This weight supports inference and evaluation with Megatron and Transformer Engine on Nvidia GPUs.