training completed: 256 128

Browse files

Files changed (6) hide show

README.md +67 -0
all_results.json +11 -0
generation_config.json +11 -0
model.safetensors +1 -1
runs/Mar05_15-07-27_8b73555077d4/events.out.tfevents.1709651249.8b73555077d4.537.0 +2 -2
test_results.json +11 -0

README.md ADDED Viewed

	@@ -0,0 +1,67 @@

+---
+tags:
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: pegasus-legalease
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# pegasus-legalease
+This model was trained from scratch on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 4.2373
+- Rouge1: 0.4847
+- Rouge2: 0.3225
+- Rougel: 0.4194
+- Rougelsum: 0.4177
+- Gen Len: 43.02
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 5
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| 5.2738        | 1.0   | 125  | 4.6004          | 0.4363 | 0.2769 | 0.375  | 0.3743    | 37.24   |
+| 4.8164        | 2.0   | 250  | 4.4350          | 0.464  | 0.3085 | 0.405  | 0.4038    | 40.3    |
+| 4.8494        | 3.0   | 375  | 4.3372          | 0.473  | 0.3153 | 0.412  | 0.41      | 41.2    |
+| 4.6062        | 4.0   | 500  | 4.2669          | 0.4791 | 0.3196 | 0.4159 | 0.4141    | 43.03   |
+| 4.5682        | 5.0   | 625  | 4.2373          | 0.4847 | 0.3225 | 0.4194 | 0.4177    | 43.02   |
+### Framework versions
+- Transformers 4.38.1
+- Pytorch 2.1.0+cu121
+- Datasets 2.18.0
+- Tokenizers 0.15.2

all_results.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+    "test_gen_len": 43.04,
+    "test_loss": 4.467371940612793,
+    "test_rouge1": 0.5024,
+    "test_rouge2": 0.3305,
+    "test_rougeL": 0.4305,
+    "test_rougeLsum": 0.4313,
+    "test_runtime": 29.5899,
+    "test_samples_per_second": 1.69,
+    "test_steps_per_second": 0.237
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "bos_token_id": 0,
+  "decoder_start_token_id": 0,
+  "eos_token_id": 1,
+  "forced_eos_token_id": 1,
+  "length_penalty": 0.6,
+  "max_length": 64,
+  "num_beams": 8,
+  "pad_token_id": 0,
+  "transformers_version": "4.38.1"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2dd165bef58417daab01777c2d4ba1568548dd76349a87802f26053a3c8ca0b3
 size 2279458540

 version https://git-lfs.github.com/spec/v1
+oid sha256:cc9bc2483efd83c76cc6e28d15e244429b745f6a7d1de702064013a75e10849c
 size 2279458540

runs/Mar05_15-07-27_8b73555077d4/events.out.tfevents.1709651249.8b73555077d4.537.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:858ee452e7e76b76242532ad2016fe6452c9c7095d325f3837898b645519fdee
-size 9764

 version https://git-lfs.github.com/spec/v1
+oid sha256:5850d9cdcf393732598bbd6d2f28c72606434f5e671d101ef29d186ce4b8f631
+size 11065

test_results.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+    "test_gen_len": 43.04,
+    "test_loss": 4.467371940612793,
+    "test_rouge1": 0.5024,
+    "test_rouge2": 0.3305,
+    "test_rougeL": 0.4305,
+    "test_rougeLsum": 0.4313,
+    "test_runtime": 29.5899,
+    "test_samples_per_second": 1.69,
+    "test_steps_per_second": 0.237
+}