Ashegh-Sad-Warrior commited on
Commit
54b6352
1 Parent(s): 1814350

End of training

Browse files
README.md CHANGED
@@ -1,10 +1,10 @@
1
  ---
2
- base_model: distilbert/distilgpt2
3
- datasets:
4
- - eli5_category
5
  license: apache-2.0
 
6
  tags:
7
  - generated_from_trainer
 
 
8
  model-index:
9
  - name: my_awesome_eli5_clm_model
10
  results: []
@@ -13,12 +13,14 @@ model-index:
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
 
 
16
  [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
17
  # my_awesome_eli5_clm_model
18
 
19
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 3.8074
22
 
23
  ## Model description
24
 
@@ -49,9 +51,9 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 3.8236 | 1.0 | 565 | 3.8062 |
53
- | 3.8259 | 2.0 | 1130 | 3.8072 |
54
- | 3.7824 | 3.0 | 1695 | 3.8074 |
55
 
56
 
57
  ### Framework versions
 
1
  ---
 
 
 
2
  license: apache-2.0
3
+ base_model: distilbert/distilgpt2
4
  tags:
5
  - generated_from_trainer
6
+ datasets:
7
+ - eli5_category
8
  model-index:
9
  - name: my_awesome_eli5_clm_model
10
  results: []
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
17
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
18
  [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
19
  # my_awesome_eli5_clm_model
20
 
21
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 3.8084
24
 
25
  ## Model description
26
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
+ | 3.9747 | 1.0 | 565 | 3.8206 |
55
+ | 3.8978 | 2.0 | 1130 | 3.8102 |
56
+ | 3.8566 | 3.0 | 1695 | 3.8084 |
57
 
58
 
59
  ### Framework versions
runs/Aug04_09-01-54_e8185cfad283/events.out.tfevents.1722762344.e8185cfad283.34.3 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:72a95739ed57cd5e07763aa7f9a40afce9c4459d5c3719caf17883f3a255c5db
3
- size 6396
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1c907e363eaa7128ab14c8be29910db29efd9a35592cfaf0bfeff08181777ac
3
+ size 7021