[email protected]
commited on
Commit
·
8cf28e4
1
Parent(s):
00e9cf1
Add more details
Browse files
README.md
CHANGED
@@ -19,6 +19,8 @@ library_name: transformers
|
|
19 |
|
20 |
# Model Overview
|
21 |
|
|
|
|
|
22 |
## Description
|
23 |
This family of models performs vision-language and text-only tasks including optical character recognition, multimodal reasoning, localization, common sense reasoning, world knowledge utilization, and coding.
|
24 |
|
|
|
19 |
|
20 |
# Model Overview
|
21 |
|
22 |
+
*[NVLM 1.0 model](https://huggingface.co/nvidia/NVLM-D-72B) is trained with legacy [Megatron-LM](https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/legacy). In this repo, we reproduce NVLM-1.0 results using the latest [Megatron-core training code](https://github.com/NVIDIA/Megatron-LM/tree/NVLM-1.0/examples/multimodal/nvlm) and share the Megatron-core model weights, training code, and evaluation scripts.*
|
23 |
+
|
24 |
## Description
|
25 |
This family of models performs vision-language and text-only tasks including optical character recognition, multimodal reasoning, localization, common sense reasoning, world knowledge utilization, and coding.
|
26 |
|