node-py's picture
Training in progress epoch 35
0b1a5ef
metadata
license: apache-2.0
base_model: bert-base-uncased
tags:
  - generated_from_keras_callback
model-index:
  - name: node-py/my_awesome_eli5_clm-model
    results: []

node-py/my_awesome_eli5_clm-model

This model is a fine-tuned version of bert-base-uncased on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 2.2501
  • Validation Loss: 4.2690
  • Epoch: 35

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
  • training_precision: float32

Training results

Train Loss Validation Loss Epoch
7.3929 6.0819 0
5.9912 5.8303 1
5.7553 5.5854 2
5.5082 5.3645 3
5.2836 5.1815 4
5.0867 5.0252 5
4.9075 4.8834 6
4.7424 4.7747 7
4.5947 4.6684 8
4.4570 4.5836 9
4.3290 4.5194 10
4.2123 4.4408 11
4.1037 4.3965 12
3.9979 4.3630 13
3.8983 4.3101 14
3.8011 4.2792 15
3.7097 4.2592 16
3.6182 4.2285 17
3.5337 4.2061 18
3.4483 4.1943 19
3.3589 4.1787 20
3.2776 4.1684 21
3.1959 4.1764 22
3.1161 4.1673 23
3.0389 4.1643 24
2.9631 4.1647 25
2.8859 4.1639 26
2.8110 4.1737 27
2.7362 4.1677 28
2.6631 4.1952 29
2.5915 4.2045 30
2.5227 4.1999 31
2.4573 4.2159 32
2.3862 4.2372 33
2.3205 4.2358 34
2.2501 4.2690 35

Framework versions

  • Transformers 4.44.0
  • TensorFlow 2.16.1
  • Datasets 2.21.0
  • Tokenizers 0.19.1