BoghdadyJR commited on
Commit
cc0bcf1
·
verified ·
1 Parent(s): b989484

Model save

Browse files
README.md CHANGED
@@ -9,14 +9,14 @@ model-index:
9
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
  should probably proofread and complete it, then remove this comment. -->
11
 
12
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/uwzjiu3j)
13
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/uwzjiu3j)
14
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/uwzjiu3j)
15
  # gpt2
16
 
17
  This model was trained from scratch on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 3.3085
20
 
21
  ## Model description
22
 
@@ -36,22 +36,19 @@ More information needed
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 2e-05
39
- - train_batch_size: 4
40
- - eval_batch_size: 8
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - num_epochs: 5
 
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:-----:|:---------------:|
50
- | 3.2909 | 1.0 | 5000 | 3.3493 |
51
- | 3.2351 | 2.0 | 10000 | 3.3100 |
52
- | 3.2898 | 3.0 | 15000 | 3.3052 |
53
- | 3.1497 | 4.0 | 20000 | 3.3064 |
54
- | 3.1327 | 5.0 | 25000 | 3.3085 |
55
 
56
 
57
  ### Framework versions
 
9
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
  should probably proofread and complete it, then remove this comment. -->
11
 
12
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/lbif3rjw)
13
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/lbif3rjw)
14
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/lbif3rjw)
15
  # gpt2
16
 
17
  This model was trained from scratch on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 3.4161
20
 
21
  ## Model description
22
 
 
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 2e-05
39
+ - train_batch_size: 2
40
+ - eval_batch_size: 2
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
+ - num_epochs: 1
45
+ - mixed_precision_training: Native AMP
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:-----:|:---------------:|
51
+ | 3.3059 | 1.0 | 10000 | 3.4161 |
 
 
 
 
52
 
53
 
54
  ### Framework versions
runs/Aug11_15-55-50_bf11bf8ef52d/events.out.tfevents.1723391750.bf11bf8ef52d.34.3 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:42cef9896e32b27e82012e06bbb49d8df4b1522a5ca507bee2657fa1ece2feda
3
- size 9264
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9ee56e4faf8a621047182570235c780e05fb6f3cd2fd72b4b445e519bd1f6a62
3
+ size 9889