pere's picture
Saving weights and logs of step 3000 - epoch 6
54bd27e
metadata
language:
  - 'no'
license: apache-2.0
tags:
  - audio
  - asr
  - automatic-speech-recognition
  - hf-asr-leaderboard
model-index:
  - name: scream_non_large_timestamp_test
    results: []

scream_non_large_timestamp_test

This model is a fine-tuned version of NbAiLab/scream_non_large_1e06_beams5_constantlr_long on the NbAiLab/NCC_whisper_both_timestamp dataset.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 1e-06
  • lr_scheduler_type: linear
  • per_device_train_batch_size: 16
  • total_train_batch_size_per_node: 64
  • total_train_batch_size: 64
  • total_optimization_steps: 5,000
  • starting_optimization_step: None
  • finishing_optimization_step: 5,000
  • num_train_dataset_workers: 32
  • num_hosts: 1
  • total_num_training_examples: 320,000
  • steps_per_epoch: 475
  • num_beams: 5

Training results

step eval_loss train_loss eval_wer eval_cer
0 0.5499 0.6409 10.5055 5.6538
500 0.4301 0.1060 10.1401 4.9232
1000 0.4168 0.0776 10.3837 4.8123
1500 0.4132 0.0689 10.5968 4.8123
2000 0.4176 0.0511 10.5968 4.6712
2500 0.4261 0.0454 11.0536 4.7871
3000 0.4385 0.0441 11.1449 4.7770

Framework versions

  • Transformers 4.29.0.dev0
  • Datasets 2.12.0
  • Tokenizers 0.13.3