catallama
/

CataLlama-v0.1-Base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

laurentiubp commited on May 26

Commit

0f82ac1

•

1 Parent(s): 377f50a

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -61,6 +61,8 @@ print(outputs[0]["generated_text"][len(prompt):])
 The model was trained **without a prompt template**, only with raw text separated by BOS and EOS tokens.
 Example:
 ```text
@@ -105,4 +107,4 @@ The following hyperparameters were used during training:
 **Out-of-scope** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in any other way that is prohibited by the Acceptable Use Policy and Llama 3 Community License. Use in languages other than English**.
-**Note: Developers may fine-tune Llama 3 models for languages beyond English provided they comply with the Llama 3 Community License and the Acceptable Use Policy.

 The model was trained **without a prompt template**, only with raw text separated by BOS and EOS tokens.
+The model was trained for two epochs on **6x A100 80GB GPUs using DeepSpeed ZeRO** State-3 without CPU offloading.
 Example:
 ```text
 **Out-of-scope** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in any other way that is prohibited by the Acceptable Use Policy and Llama 3 Community License. Use in languages other than English**.
+**Note: Developers may fine-tune Llama 3 models for languages beyond English provided they comply with the Llama 3 Community License and the Acceptable Use Policy.