BramVanroy
/

GEITje-7B-ultra-sft

Text Generation

alignment-handbook

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

BramVanroy commited on Jan 23, 2024

Commit

0f0ac94

·

verified ·

1 Parent(s): 4329cff

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -53,12 +53,11 @@ Here is a break down of the training set (some data pages might not be available
 - [BramVanroy/dolly-15k-dutch](https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch) (gpt-3.5-turbo; translated): 1.39%
 ## Training procedure
 The great [alignment handbook](https://github.com/huggingface/alignment-handbook/) was used for training, with a custom slurm script for compatibility with our cluster. It was trained in full, without LoRA or other adapters.
-The model was trained in bfloat16 with flash attention 2 and a context length of 8192.
 Recipe used with the handbook:

 - [BramVanroy/dolly-15k-dutch](https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch) (gpt-3.5-turbo; translated): 1.39%
 ## Training procedure
 The great [alignment handbook](https://github.com/huggingface/alignment-handbook/) was used for training, with a custom slurm script for compatibility with our cluster. It was trained in full, without LoRA or other adapters.
+The model was trained in bfloat16 with flash attention 2 and a context length of 8192. You can find the [wandb logs](https://wandb.ai/bramvanroy/sft-geitje-ultra) here.
 Recipe used with the handbook: