MERaLiON
/

MERaLiON-AudioLLM-Whisper-SEA-LION

Automatic Speech Recognition

Model card Files Files and versions Community

YingxuHe commited on Dec 19, 2024

Commit

9be92e3

·

verified ·

1 Parent(s): d641734

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -69,7 +69,7 @@ against three well-known AudioLLMs: `Qwen2-Audio 7B`, `WavLLM`, and `SALMONN`. W
 which feeds the transcriptions recognized by Whisper-large-v2 and the instruction prompts to a Gemma2 9B CPT SEA-LIONv3 Instruct model to
 get the responses. We tuned its hyperparameters and prompt template to optimise performance across
 various speech-to-text tasks. As is shown in the following table, MERaLiON-AudioLLM performs better in the Singapore local context,
-as evidenced by evaluation results on Singapore's [Multitask National Speech Corpus](MERaLiON/MNSC) (MNSC) datasets.
 > [!NOTE]
 > MNSC is a multitask speech understanding dataset derived and further annotated from [IMDA NSC Corpus](https://www.imda.gov.sg/how-we-can-help/national-speech-corpus).

 which feeds the transcriptions recognized by Whisper-large-v2 and the instruction prompts to a Gemma2 9B CPT SEA-LIONv3 Instruct model to
 get the responses. We tuned its hyperparameters and prompt template to optimise performance across
 various speech-to-text tasks. As is shown in the following table, MERaLiON-AudioLLM performs better in the Singapore local context,
+as evidenced by evaluation results on Singapore's [Multitask National Speech Corpus](https://huggingface.co/datasets/MERaLiON/Multitask-National-Speech-Corpus-v1) (MNSC) datasets.
 > [!NOTE]
 > MNSC is a multitask speech understanding dataset derived and further annotated from [IMDA NSC Corpus](https://www.imda.gov.sg/how-we-can-help/national-speech-corpus).