shorecode
/

t5-efficient-tiny-summarizer-general-purpose

@@ -1,14 +1,12 @@
 ---
 library_name: transformers
 license: apache-2.0
-base_model: google/t5-efficient-tiny-nh8
 tags:
 - generated_from_trainer
 model-index:
 - name: t5-efficient-tiny-nh8-summarizer
   results: []
-datasets:
-- shorecode/summary-collection-60k-rows
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,32 +14,30 @@ should probably proofread and complete it, then remove this comment. -->
 # t5-efficient-tiny-nh8-summarizer
-This model is a fine-tuned version of [google/t5-efficient-tiny-nh8](https://huggingface.co/google/t5-efficient-tiny-nh8) on shorecode/summary-collection-60k-rows.
 It achieves the following results on the evaluation set:
-- Loss: 0.7583
 ## Model description
-A general purpose text summarizer
 ## Intended uses & limitations
-A general purpose text summarizer
 ## Training and evaluation data
-Trained and evaluated on shorecode/summary-collection-60k-rows
 ## Training procedure
-Trained using the Gradio SDK on Hugging Face Spaces using shared Zero GPU(s)
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 7.000000000000001e-05
-- train_batch_size: 70
-- eval_batch_size: 70
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
@@ -52,18 +48,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.1522        | 0.2328 | 200  | 0.9863          |
-| 0.9677        | 0.4657 | 400  | 0.9158          |
-| 0.9143        | 0.6985 | 600  | 0.8762          |
-| 0.8894        | 0.9313 | 800  | 0.8478          |
-| 0.8586        | 1.1641 | 1000 | 0.8262          |
-| 0.8382        | 1.3970 | 1200 | 0.8079          |
-| 0.8198        | 1.6298 | 1400 | 0.7938          |
-| 0.805         | 1.8626 | 1600 | 0.7823          |
-| 0.8035        | 2.0955 | 1800 | 0.7727          |
-| 0.7897        | 2.3283 | 2000 | 0.7661          |
-| 0.7849        | 2.5611 | 2200 | 0.7607          |
-| 0.7781        | 2.7939 | 2400 | 0.7583          |
 ### Framework versions
@@ -71,4 +66,4 @@ The following hyperparameters were used during training:
 - Transformers 4.47.0
 - Pytorch 2.4.0+cu121
 - Datasets 3.0.0
-- Tokenizers 0.21.0

 ---
 library_name: transformers
 license: apache-2.0
+base_model: shorecode/t5-efficient-tiny-nh8-summarizer
 tags:
 - generated_from_trainer
 model-index:
 - name: t5-efficient-tiny-nh8-summarizer
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # t5-efficient-tiny-nh8-summarizer
+This model is a fine-tuned version of [shorecode/t5-efficient-tiny-nh8-summarizer](https://huggingface.co/shorecode/t5-efficient-tiny-nh8-summarizer) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6597
 ## Model description
+More information needed
 ## Intended uses & limitations
+More information needed
 ## Training and evaluation data
+More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.00015000000000000001
+- train_batch_size: 63
+- eval_batch_size: 63
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.0837        | 0.2663 | 200  | 0.9227          |
+| 0.9027        | 0.5326 | 400  | 0.8449          |
+| 0.842         | 0.7989 | 600  | 0.7949          |
+| 0.7971        | 1.0652 | 800  | 0.7585          |
+| 0.768         | 1.3316 | 1000 | 0.7288          |
+| 0.7359        | 1.5979 | 1200 | 0.7069          |
+| 0.7145        | 1.8642 | 1400 | 0.6898          |
+| 0.7047        | 2.1305 | 1600 | 0.6773          |
+| 0.6926        | 2.3968 | 1800 | 0.6678          |
+| 0.6855        | 2.6631 | 2000 | 0.6620          |
+| 0.68          | 2.9294 | 2200 | 0.6597          |
 ### Framework versions
 - Transformers 4.47.0
 - Pytorch 2.4.0+cu121
 - Datasets 3.0.0
+- Tokenizers 0.21.0

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "google/t5-efficient-tiny-nh8",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

 {
+  "_name_or_path": "shorecode/t5-efficient-tiny-nh8-summarizer",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5519657de515588fd59da723420c7ffe9fd55f1cd3e7383d9b08c485488ce85b
 size 62293080

 version https://git-lfs.github.com/spec/v1
+oid sha256:0a59a8c9c0ed288ff84d9af8d349bc7f8a93fef22d16d02f70e19f317c75f18e
 size 62293080

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:639965cd1c0265d0ae1ef8aafaa8aec94659deccc7dbe06d4dc3452d468701c7
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:639711c33405bbf09f602e52ebfd3058526167da57c50ed1314590513f1c12fe
 size 5304