catallama
/

CataLlama-v0.2-Instruct-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

laurentiubp commited on Jul 15, 2024

Commit

7e7bcd5

·

verified ·

1 Parent(s): 399ba57

Update README.md

Files changed (1) hide show

README.md +0 -2

README.md CHANGED Viewed

@@ -98,8 +98,6 @@ The model was trained **with the same prompt template of Llama-3 Instruct**.
 The model was trained for two epochs on **8x A100 80GB GPUs using DeepSpeed ZeRO** State-3 without CPU offloading.
-Then training lasted approximately 3 hours for a total GPU cost of 45€.
 ### Training hyperparameters


98
99	The model was trained for two epochs on 8x A100 80GB GPUs using DeepSpeed ZeRO State-3 without CPU offloading.
100


101
102	### Training hyperparameters
103