tilyupo
/

t5-base-mmlu-qa2a

Text2Text Generation

generated_from_keras_callback

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tilyupo commited on Aug 7, 2023

Commit

f4558db

·

1 Parent(s): 7a07d39

batch_size=128

Files changed (3) hide show

README.md +4 -4
config.json +1 -2
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -15,8 +15,8 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.2609
-- Validation Loss: 0.1324
 - Epoch: 0
 ## Model description
@@ -43,12 +43,12 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 0.2609     | 0.1324          | 0     |
 ### Framework versions
 - Transformers 4.31.0
 - TensorFlow 2.12.0
-- Datasets 2.14.1
 - Tokenizers 0.13.3

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.4630
+- Validation Loss: 0.6959
 - Epoch: 0
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 0.4630     | 0.6959          | 0     |
 ### Framework versions
 - Transformers 4.31.0
 - TensorFlow 2.12.0
+- Datasets 2.14.3
 - Tokenizers 0.13.3

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "tilyupo/t5-base-mmlu-qa2a",
   "architectures": [
     "T5ForConditionalGeneration"
   ],
@@ -54,7 +54,6 @@
     }
   },
   "tie_word_embeddings": false,
-  "torch_dtype": "float32",
   "transformers_version": "4.31.0",
   "use_cache": true,
   "vocab_size": 32128

 {
+  "_name_or_path": "google/flan-t5-base",
   "architectures": [
     "T5ForConditionalGeneration"
   ],
     }
   },
   "tie_word_embeddings": false,
   "transformers_version": "4.31.0",
   "use_cache": true,
   "vocab_size": 32128

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2624b5e9c9428bf69fc362d07940cb9c8b845f1b6260bdbee2fca7df91e83016
 size 1188285040

 version https://git-lfs.github.com/spec/v1
+oid sha256:871821094888498cd5e70489f6be076125e9828f7962f02159eafecebd282e95
 size 1188285040