ymoslem's picture
End of training
f249ee2 verified
|
raw
history blame
4.44 kB
metadata
language:
  - ga
  - en
license: apache-2.0
base_model: openai/whisper-small
tags:
  - generated_from_trainer
datasets:
  - ymoslem/IWSLT2023-GA-EN
  - ymoslem/FLEURS-GA-EN
  - ymoslem/BitesizeIrish-GA-EN
  - ymoslem/SpokenWords-GA-EN-MTed
  - ymoslem/Tatoeba-Speech-Irish
  - ymoslem/Wikimedia-Speech-Irish
metrics:
  - bleu
  - wer
model-index:
  - name: Whisper Small GA-EN Speech Translation
    results:
      - task:
          name: Automatic Speech Recognition
          type: automatic-speech-recognition
        dataset:
          name: IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia
          type: ymoslem/IWSLT2023-GA-EN
        metrics:
          - name: Bleu
            type: bleu
            value: 27.57
          - name: Wer
            type: wer
            value: 70.64385411976588

Whisper Small GA-EN Speech Translation

This model is a fine-tuned version of openai/whisper-small on the IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia dataset. It achieves the following results on the evaluation set:

  • Loss: 1.1870
  • Bleu: 27.57
  • Chrf: 46.72
  • Wer: 70.6439

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 32
  • eval_batch_size: 32
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 0.03
  • training_steps: 3000
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Bleu Chrf Validation Loss Wer
2.6685 0.07 100 5.05 20.18 2.0544 139.8919
2.4028 0.13 200 12.29 29.72 1.7367 95.5425
2.1231 0.2 300 14.33 30.77 1.6141 101.3958
1.9192 0.26 400 16.86 35.65 1.4778 91.0851
1.7129 0.33 500 16.77 37.53 1.3811 93.8766
1.5398 0.39 600 18.85 39.0 1.3427 90.2296
1.4257 0.46 700 25.73 43.3 1.2784 70.3287
1.3044 0.53 800 25.43 44.33 1.2274 72.3548
1.2626 0.59 900 25.09 44.62 1.1875 72.6249
1.2801 0.66 1000 25.68 45.53 1.1571 71.0491
1.2876 0.72 1100 20.62 41.49 1.2193 85.8622
1.2609 0.79 1200 29.47 45.04 1.2079 65.2859
1.187 0.85 1300 24.65 43.73 1.2086 72.9851
1.0342 0.92 1400 30.34 47.62 1.1766 64.3854
1.0519 0.98 1500 29.39 47.69 1.1425 64.9707
0.5473 1.05 1600 28.02 46.27 1.1842 67.6722
0.4886 1.12 1700 26.62 46.37 1.1845 76.4971
0.4354 1.18 1800 23.63 45.16 1.1621 86.1324
0.4709 1.25 1900 27.86 47.3 1.1544 73.7506
0.4802 1.31 2000 30.25 48.12 1.1571 64.9707
0.4565 1.38 2100 1.2095 24.75 44.7 77.4426
0.4797 1.44 2200 1.2051 28.46 46.03 67.1769
0.423 1.51 2300 1.2079 28.34 47.65 68.6177
0.4254 1.58 2400 1.2251 27.78 46.01 67.8523
0.4493 1.64 2500 1.1898 26.61 47.8 71.1391
0.3614 1.71 2600 1.2079 30.08 47.25 64.2954
0.4052 1.77 2700 1.1975 30.88 47.44 64.2053
0.3541 1.84 2800 1.2006 28.4 46.02 70.2837
0.3736 1.9 2900 1.1906 30.82 47.52 64.1153
0.3326 1.97 3000 1.1870 27.57 46.72 70.6439

Framework versions

  • Transformers 4.39.3
  • Pytorch 2.2.1+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2