flax-community
/

arabic-t5-small

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

salti commited on Jul 25, 2021

Commit

fd87949

·

1 Parent(s): 37e00b7

Add training results

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -21,8 +21,16 @@ This is a T5v1.1 (small) trained on the concatenation of the Arabic Billion Word
 |     learning rate     |    `1e-2`     |
 |         dtype         | `jnp.float32` |
-## Note for finetuning:
 This model was pretrained with dropout turned off, so the default `dropout_rate` in the model config is `0`.
 To finetune the model dropout should be turned be back on, like this:

 |     learning rate     |    `1e-2`     |
 |         dtype         | `jnp.float32` |
+## Results
+|                     |               |
+| :-----------------: | :-----------: |
+| evaluation accuracy |   `56.84%`    |
+|   evaluation loss   |    `2.423`    |
+|    training loss    |    `2.392`    |
+|    training time    | `22h 23m 51s` |
+## Note for finetuning
 This model was pretrained with dropout turned off, so the default `dropout_rate` in the model config is `0`.
 To finetune the model dropout should be turned be back on, like this: