isaacus
/

open-australian-legal-gpt2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

umarbutler commited on Nov 28, 2023

Commit

33e3c53

·

1 Parent(s): 36ffb69

Update README.md

Files changed (1) hide show

README.md +0 -1

README.md CHANGED Viewed

@@ -55,7 +55,6 @@ The training dataset was subsequently fed to [GPT2](https://huggingface.co/gpt2)
 | Batch size per device | 4 |
 | Weight decay | 0.01 |
 | Warmup ratio | 0.06 |
-| Gradient accumulation steps | 1 |
 After training for 3 epochs, or 465,441 steps, over a period of ~25 hours on two GeForce RTX 4090s, the model achieved a loss of 0.61.

 | Batch size per device | 4 |
 | Weight decay | 0.01 |
 | Warmup ratio | 0.06 |
 After training for 3 epochs, or 465,441 steps, over a period of ~25 hours on two GeForce RTX 4090s, the model achieved a loss of 0.61.