Model save

Browse files

Files changed (4) hide show

README.md +83 -0
generation_config.json +12 -0
model.safetensors +1 -1
runs/Dec28_01-27-02_william-gpu-3090-10-rxldr/events.out.tfevents.1703726825.william-gpu-3090-10-rxldr.35178.0 +2 -2

README.md ADDED Viewed

	@@ -0,0 +1,83 @@

+---
+license: apache-2.0
+base_model: facebook/bart-base
+tags:
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: fine-tuned-bart-20-epochs
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# fine-tuned-bart-20-epochs
+This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.8098
+- Rouge1: 0.3246
+- Rouge2: 0.1287
+- Rougel: 0.2921
+- Rougelsum: 0.2912
+- Gen Len: 14.96
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 4
+- eval_batch_size: 4
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 20
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| No log        | 1.0   | 301  | 0.7895          | 0.2498 | 0.0834 | 0.2162 | 0.2159    | 14.58   |
+| 1.8122        | 2.0   | 602  | 0.7331          | 0.2226 | 0.0794 | 0.1943 | 0.1931    | 13.51   |
+| 1.8122        | 3.0   | 903  | 0.7235          | 0.2935 | 0.1013 | 0.2652 | 0.2647    | 14.69   |
+| 0.6848        | 4.0   | 1204 | 0.7225          | 0.322  | 0.1245 | 0.2867 | 0.2857    | 13.92   |
+| 0.5826        | 5.0   | 1505 | 0.7238          | 0.322  | 0.1149 | 0.2863 | 0.2854    | 14.81   |
+| 0.5826        | 6.0   | 1806 | 0.7204          | 0.3255 | 0.1212 | 0.2977 | 0.2963    | 14.98   |
+| 0.5013        | 7.0   | 2107 | 0.7377          | 0.3061 | 0.1104 | 0.2784 | 0.2767    | 14.84   |
+| 0.5013        | 8.0   | 2408 | 0.7396          | 0.3092 | 0.1227 | 0.275  | 0.2741    | 14.17   |
+| 0.4384        | 9.0   | 2709 | 0.7413          | 0.3224 | 0.1271 | 0.2935 | 0.2928    | 14.44   |
+| 0.3952        | 10.0  | 3010 | 0.7458          | 0.3288 | 0.1302 | 0.2925 | 0.2925    | 15.09   |
+| 0.3952        | 11.0  | 3311 | 0.7615          | 0.3496 | 0.139  | 0.3139 | 0.3137    | 15.13   |
+| 0.3626        | 12.0  | 3612 | 0.7733          | 0.3311 | 0.1264 | 0.3057 | 0.3049    | 14.84   |
+| 0.3626        | 13.0  | 3913 | 0.7779          | 0.3184 | 0.1226 | 0.286  | 0.2857    | 15.02   |
+| 0.3254        | 14.0  | 4214 | 0.7854          | 0.3258 | 0.1199 | 0.2911 | 0.2915    | 14.89   |
+| 0.2983        | 15.0  | 4515 | 0.7863          | 0.3346 | 0.1189 | 0.3027 | 0.3009    | 14.93   |
+| 0.2983        | 16.0  | 4816 | 0.7979          | 0.3201 | 0.117  | 0.2857 | 0.2843    | 15.05   |
+| 0.2807        | 17.0  | 5117 | 0.8037          | 0.3223 | 0.1216 | 0.291  | 0.2899    | 15.1    |
+| 0.2807        | 18.0  | 5418 | 0.8048          | 0.3313 | 0.1261 | 0.3003 | 0.2996    | 15.1    |
+| 0.2653        | 19.0  | 5719 | 0.8114          | 0.3285 | 0.1298 | 0.297  | 0.2963    | 15.01   |
+| 0.2562        | 20.0  | 6020 | 0.8098          | 0.3246 | 0.1287 | 0.2921 | 0.2912    | 14.96   |
+### Framework versions
+- Transformers 4.36.2
+- Pytorch 1.12.1+cu113
+- Datasets 2.15.0
+- Tokenizers 0.15.0

generation_config.json ADDED Viewed

	@@ -0,0 +1,12 @@

+{
+  "bos_token_id": 0,
+  "decoder_start_token_id": 2,
+  "early_stopping": true,
+  "eos_token_id": 2,
+  "forced_bos_token_id": 0,
+  "forced_eos_token_id": 2,
+  "no_repeat_ngram_size": 3,
+  "num_beams": 4,
+  "pad_token_id": 1,
+  "transformers_version": "4.36.2"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cbfc89128e635dd12a99e483a8f660ec20e36844c48e92a8cb6c660f820c9347
 size 557912620

 version https://git-lfs.github.com/spec/v1
+oid sha256:518afc220febcc483bd05c196af9abf1f0894aa7dc5d9c9962b8ec18d01a8683
 size 557912620

runs/Dec28_01-27-02_william-gpu-3090-10-rxldr/events.out.tfevents.1703726825.william-gpu-3090-10-rxldr.35178.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5fd8bf05ce0ab5fee63f360f58659cd21a3715425d0a7be9479108d240bb63c6
-size 17491

 version https://git-lfs.github.com/spec/v1
+oid sha256:08d4565fc412c3f0597c166d7db59af0ebaa144565104ead16d97acb91c04e29
+size 18370