ahmedmbutt
/

PTS-Bart-Large-CNN

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0177
-- Rouge1: 0.6339
-- Rouge2: 0.4113
-- Rougel: 0.5344
-- Rougelsum: 0.5338
-- Gen Len: 76.1278
 ## Model description
@@ -47,17 +47,21 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| No log        | 1.0   | 180  | 0.9026          | 0.6109 | 0.3819 | 0.5098 | 0.5094    | 76.9722 |
-| No log        | 2.0   | 360  | 0.9012          | 0.6273 | 0.4054 | 0.5285 | 0.5284    | 76.3833 |
-| 0.6717        | 3.0   | 540  | 0.9357          | 0.6312 | 0.4071 | 0.5297 | 0.5295    | 76.25   |
-| 0.6717        | 4.0   | 720  | 1.0177          | 0.6339 | 0.4113 | 0.5344 | 0.5338    | 76.1278 |
 ### Framework versions

 This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2638
+- Rouge1: 0.6376
+- Rouge2: 0.4143
+- Rougel: 0.538
+- Rougelsum: 0.5387
+- Gen Len: 76.8417
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 8
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| No log        | 1.0   | 180  | 0.8748          | 0.6166 | 0.3827 | 0.5058 | 0.5055    | 77.6583 |
+| No log        | 2.0   | 360  | 0.8774          | 0.6307 | 0.4064 | 0.5302 | 0.531     | 77.5111 |
+| 0.6761        | 3.0   | 540  | 0.9064          | 0.635  | 0.4052 | 0.5309 | 0.5311    | 76.2833 |
+| 0.6761        | 4.0   | 720  | 1.0386          | 0.6329 | 0.4038 | 0.5261 | 0.5262    | 78.4889 |
+| 0.6761        | 5.0   | 900  | 1.0993          | 0.6285 | 0.4016 | 0.5239 | 0.5246    | 77.0083 |
+| 0.2016        | 6.0   | 1080 | 1.2025          | 0.6351 | 0.4126 | 0.5351 | 0.5356    | 76.0722 |
+| 0.2016        | 7.0   | 1260 | 1.2399          | 0.6356 | 0.4108 | 0.5362 | 0.5368    | 78.5361 |
+| 0.2016        | 8.0   | 1440 | 1.2638          | 0.6376 | 0.4143 | 0.538  | 0.5387    | 76.8417 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:26069ed409332087a3b642616a8601415e1167961df45defab88435ba16e7d80
 size 1625422896

 version https://git-lfs.github.com/spec/v1
+oid sha256:b2ab14edcc212f1ec5f3027c42bc5f7b2b0d51f4d939f26334a4dd0c05c5b4c1
 size 1625422896

runs/Jun15_11-27-20_4441297fbb55/events.out.tfevents.1718450841.4441297fbb55.7613.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3e73d53a377d0ee13dfca05ad17b032dbb8e9899421ddcbdb62d2206cf25cba7
-size 8991

 version https://git-lfs.github.com/spec/v1
+oid sha256:695853e5180610e9e961e64d33e0e7f4f80f2d6e2ee21fc55d94c3929a4b76fe
+size 10920