Japanese ASR Evaluation Dataset
Japanese ASR
non-profit
AI & ML interests
This repo contains models and datasets for Japanese ASR. See our main model https://huggingface.co./kotoba-tech/kotoba-whisper-v1.0.
Organization Card
Japanese ASR
This repository contains all the models and datasets for train/evaluate the Japanese ASR dataset generated through the process of achieving kotoba-whisper models.
Following table shows CER comparison with different data size of ReazonSpeech used to distill openai/whisper-large-v3. The model names follows
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-{size of reazonspeech}
.
CER
model | CommonVoice 8.0 | JSUT basic5000 | ReazonSpeech Test |
---|---|---|---|
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-all | 9.20 | 8.40 | 11.63 |
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-large | 9.44 | 8.48 | 12.60 |
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-medium | 10.89 | 11.25 | 16.37 |
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-small | 30.48 | 38.96 | 42.29 |
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-tiny | 94.69 | 95.32 | 95.82 |
openai/whisper-large-v3 | 8.52 | 7.18 | 15.18 |
openai/whisper-large-v2 | 9.70 | 8.20 | 28.50 |
openai/whisper-large | 10.00 | 8.90 | 34.40 |
openai/whisper-base | 28.20 | 25.00 | 69.40 |
openai/whisper-medium | 11.34 | 9.87 | 29.56 |
openai/whisper-small | 15.26 | 14.22 | 34.29 |
openai/whisper-tiny | 46.86 | 35.69 | 96.69 |
reazon-research/reazonspeech-nemo-v2 | 9.07 | 7.43 | 11.17 |
Please find more detailed results at kotoba-whisper codebase.
Collections
6
Japanese ASR Models
-
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-all
Updated • 116 • 2 -
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-large
Automatic Speech Recognition • Updated • 54 • 5 -
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-medium
Automatic Speech Recognition • Updated • 24 • 2 -
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-tiny
Automatic Speech Recognition • Updated • 15 • 1
spaces
1
models
5
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-all
Updated
•
116
•
2
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-large
Automatic Speech Recognition
•
Updated
•
54
•
5
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-medium
Automatic Speech Recognition
•
Updated
•
24
•
2
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-small
Automatic Speech Recognition
•
Updated
•
53
•
3
japanese-asr/distil-whisper-large-v3-ja-reazonspeech-tiny
Automatic Speech Recognition
•
Updated
•
15
•
1
datasets
27
japanese-asr/en_asr.esb_eval
Viewer
•
Updated
•
23.9k
japanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0.vectorized
Viewer
•
Updated
•
5.51M
•
2.18k
japanese-asr/whisper_transcriptions.mls.wer_10.0.vectorized
Viewer
•
Updated
•
7.44M
•
1.28k
•
1
japanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0
Viewer
•
Updated
•
6.07M
•
277
japanese-asr/whisper_transcriptions.reazon_speech_all
Viewer
•
Updated
•
17.3M
•
458
•
1
japanese-asr/whisper_transcriptions.mls.wer_10.0
Viewer
•
Updated
•
9.33M
•
168
•
1
japanese-asr/whisper_transcriptions.mls
Viewer
•
Updated
•
10.4M
•
293
•
1
japanese-asr/en_asr.mls
Viewer
•
Updated
•
10.4M
•
255
•
2
japanese-asr/whisper_transcriptions.reazonspeech.all.wer_10.0.vectorized
Viewer
•
Updated
•
6.49M
•
880
japanese-asr/whisper_transcriptions.reazonspeech.all
Viewer
•
Updated
•
21.9M
•
10
•
2