laurentiubp
commited on
Commit
•
0f82ac1
1
Parent(s):
377f50a
Update README.md
Browse files
README.md
CHANGED
@@ -61,6 +61,8 @@ print(outputs[0]["generated_text"][len(prompt):])
|
|
61 |
|
62 |
The model was trained **without a prompt template**, only with raw text separated by BOS and EOS tokens.
|
63 |
|
|
|
|
|
64 |
Example:
|
65 |
|
66 |
```text
|
@@ -105,4 +107,4 @@ The following hyperparameters were used during training:
|
|
105 |
|
106 |
**Out-of-scope** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in any other way that is prohibited by the Acceptable Use Policy and Llama 3 Community License. Use in languages other than English**.
|
107 |
|
108 |
-
**Note: Developers may fine-tune Llama 3 models for languages beyond English provided they comply with the Llama 3 Community License and the Acceptable Use Policy.
|
|
|
61 |
|
62 |
The model was trained **without a prompt template**, only with raw text separated by BOS and EOS tokens.
|
63 |
|
64 |
+
The model was trained for two epochs on **6x A100 80GB GPUs using DeepSpeed ZeRO** State-3 without CPU offloading.
|
65 |
+
|
66 |
Example:
|
67 |
|
68 |
```text
|
|
|
107 |
|
108 |
**Out-of-scope** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in any other way that is prohibited by the Acceptable Use Policy and Llama 3 Community License. Use in languages other than English**.
|
109 |
|
110 |
+
**Note: Developers may fine-tune Llama 3 models for languages beyond English provided they comply with the Llama 3 Community License and the Acceptable Use Policy.
|