Automatic Speech Recognition
Transformers
Safetensors
whisper
Inference Endpoints
File size: 717 Bytes
778b54d
 
cc7d878
 
 
 
4702859
cc7d878
 
 
 
778b54d
 
4702859
778b54d
e713994
 
cf9972a
 
 
4702859
cf9972a
 
 
778b54d
cf9972a
9a4157b
 
 
 
a1a28df
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
library_name: transformers
license: mit
datasets:
- AlienKevin/mixed_cantonese_and_english_speech
- mozilla-foundation/common_voice_17_0
- mozilla-foundation/common_voice_11_0
metrics:
- cer
base_model:
- openai/whisper-large-v3-turbo
---

CER: 13.7% <br/>

transformers-4.46.3<br/>

Train Args:<br/>
per_device_train_batch_size=16,<br/>
gradient_accumulation_steps=1,<br/>
learning_rate=1e-5,<br/>
gradient_checkpointing=True,<br/>
per_device_eval_batch_size=16,<br/>
generation_max_length=225,<br/>

Hardware:<br/>
NVIDIA Tesla V100 16GB * 4<br/>


FAQ:
1. If having tokenizer issue during inference, please update your transformers version to >= 4.46.3

```bash
pip install --upgrade transformers==4.46.3
```