Malaysian Finetune Whisper Large V3 Turbo
Finetune Whisper Large V3 Turbo on Malaysian context.
Improvement
- Distilled from Whisper Large V3 on Malaysian and Science context.
- Better translation for Malay, Manglish, Mandarin, Tamil and Science context.
- Word level timestamp, introduced
<|transcribeprecise|>
token, a new task!
how we finetuned it?
We done 2 phases,
- Finetune on mesolitica/Malaysian-STT-Whisper
- WanDB at https://wandb.ai/huseinzol05/malaysian-whisper-large-v3-turbo-v3?nw=nwuserhuseinzol05, still on training
- Annealing on 5% from mesolitica/Malaysian-STT-Whisper and 100% from malaysia-ai/STT-Whisper, still on training
- Downloads last month
- 89
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.
Model tree for malaysia-ai/malaysian-whisper-large-v3-turbo
Base model
openai/whisper-large-v3
Finetuned
openai/whisper-large-v3-turbo