jordanfan commited on
Commit
422c3f8
1 Parent(s): 24617b0

training completed[dev]: 1024 128

Browse files
README.md CHANGED
@@ -18,12 +18,12 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.9368
22
- - Rouge1: 0.7111
23
- - Rouge2: 0.4588
24
- - Rougel: 0.6541
25
- - Rougelsum: 0.6542
26
- - Wer: 0.433
27
 
28
  ## Model description
29
 
@@ -48,20 +48,28 @@ The following hyperparameters were used during training:
48
  - seed: 42
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
- - num_epochs: 1
52
  - mixed_precision_training: Native AMP
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Wer |
57
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|
58
- | No log | 0.13 | 250 | 1.1442 | 0.6749 | 0.4062 | 0.6133 | 0.6132 | 0.4806 |
59
- | 2.053 | 0.27 | 500 | 1.0353 | 0.6859 | 0.4274 | 0.6269 | 0.6269 | 0.4586 |
60
- | 2.053 | 0.4 | 750 | 1.0013 | 0.6935 | 0.4384 | 0.6351 | 0.6352 | 0.4499 |
61
- | 1.1091 | 0.53 | 1000 | 0.9866 | 0.7003 | 0.4467 | 0.6416 | 0.6417 | 0.4425 |
62
- | 1.1091 | 0.66 | 1250 | 0.9591 | 0.7052 | 0.4512 | 0.6469 | 0.647 | 0.4386 |
63
- | 1.0491 | 0.8 | 1500 | 0.9502 | 0.7035 | 0.4517 | 0.6469 | 0.647 | 0.4366 |
64
- | 1.0491 | 0.93 | 1750 | 0.9368 | 0.7111 | 0.4588 | 0.6541 | 0.6542 | 0.433 |
 
 
 
 
 
 
 
 
65
 
66
 
67
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.8876
22
+ - Rouge1: 0.7224
23
+ - Rouge2: 0.4761
24
+ - Rougel: 0.6677
25
+ - Rougelsum: 0.6675
26
+ - Wer: 0.4176
27
 
28
  ## Model description
29
 
 
48
  - seed: 42
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 2
52
  - mixed_precision_training: Native AMP
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Wer |
57
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|
58
+ | No log | 0.13 | 250 | 1.1438 | 0.6714 | 0.403 | 0.61 | 0.6098 | 0.4822 |
59
+ | 2.0429 | 0.27 | 500 | 1.0396 | 0.6869 | 0.4286 | 0.6276 | 0.6274 | 0.4574 |
60
+ | 2.0429 | 0.4 | 750 | 1.0071 | 0.6941 | 0.4396 | 0.636 | 0.6359 | 0.4501 |
61
+ | 1.1127 | 0.53 | 1000 | 0.9806 | 0.7006 | 0.445 | 0.6414 | 0.6413 | 0.444 |
62
+ | 1.1127 | 0.66 | 1250 | 0.9681 | 0.7001 | 0.4471 | 0.6423 | 0.6423 | 0.4404 |
63
+ | 1.0522 | 0.8 | 1500 | 0.9541 | 0.7026 | 0.4502 | 0.646 | 0.646 | 0.4375 |
64
+ | 1.0522 | 0.93 | 1750 | 0.9325 | 0.7125 | 0.461 | 0.6565 | 0.6564 | 0.431 |
65
+ | 1.0094 | 1.06 | 2000 | 0.9239 | 0.7069 | 0.4593 | 0.652 | 0.6519 | 0.429 |
66
+ | 1.0094 | 1.2 | 2250 | 0.9168 | 0.71 | 0.4631 | 0.6545 | 0.6544 | 0.4265 |
67
+ | 0.9166 | 1.33 | 2500 | 0.9095 | 0.7181 | 0.4701 | 0.6631 | 0.663 | 0.4238 |
68
+ | 0.9166 | 1.46 | 2750 | 0.9051 | 0.7147 | 0.4679 | 0.6595 | 0.6594 | 0.422 |
69
+ | 0.9135 | 1.6 | 3000 | 0.8989 | 0.7227 | 0.4747 | 0.6673 | 0.6672 | 0.4203 |
70
+ | 0.9135 | 1.73 | 3250 | 0.9006 | 0.7144 | 0.4696 | 0.6603 | 0.6603 | 0.4194 |
71
+ | 0.8846 | 1.86 | 3500 | 0.8868 | 0.7199 | 0.4746 | 0.6656 | 0.6655 | 0.4176 |
72
+ | 0.8846 | 1.99 | 3750 | 0.8876 | 0.7224 | 0.4761 | 0.6677 | 0.6675 | 0.4176 |
73
 
74
 
75
  ### Framework versions
runs/Mar16_04-30-14_4b9235023404/events.out.tfevents.1710563422.4b9235023404.4117.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f441b1876f20c08fbe454ba8cde4686bbb861d9d0820e875e12c1bf6f3b2f676
3
- size 13312
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:83ba889dec2622cd8591745f635ec40db3fe8c775296e989703e4b5f00749e60
3
+ size 15440