Thaweewat
/

whisper-th-large-ct2

Inference Endpoints

Model card Files Files and versions Community

Thaweewat commited on Dec 26, 2023

Commit

4470859

•

1 Parent(s): 4a86c11

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ whisper-th-large-ct2 is the CTranslate2 format of [biodatlab/whisper-th-large-co
 - ⚡️ Batched inference for **70x** real-time transcription using Whisper large-v2.
 - 🪶 A faster-whisper backend, requiring **<8GB GPU memory** for large-v2 with beam_size=5.
 - 🎯 Accurate word-level timestamps using wav2vec2 alignment.
-- 👯‍♂️ Multispeaker ASR using speaker diarization from pyannote-audio (includes speaker ID labels).
 - 🗣️ VAD preprocessing, reducing hallucinations and allowing batching with no WER degradation.
 ### Usage

 - ⚡️ Batched inference for **70x** real-time transcription using Whisper large-v2.
 - 🪶 A faster-whisper backend, requiring **<8GB GPU memory** for large-v2 with beam_size=5.
 - 🎯 Accurate word-level timestamps using wav2vec2 alignment.
+- 👯‍♂️ Multispeaker ASR using speaker diarization(includes speaker ID labels).
 - 🗣️ VAD preprocessing, reducing hallucinations and allowing batching with no WER degradation.
 ### Usage