Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ whisper-th-large-ct2 is the CTranslate2 format of [biodatlab/whisper-th-large-co
|
|
12 |
- ⚡️ Batched inference for **70x** real-time transcription using Whisper large-v2.
|
13 |
- 🪶 A faster-whisper backend, requiring **<8GB GPU memory** for large-v2 with beam_size=5.
|
14 |
- 🎯 Accurate word-level timestamps using wav2vec2 alignment.
|
15 |
-
- 👯♂️ Multispeaker ASR using speaker diarization
|
16 |
- 🗣️ VAD preprocessing, reducing hallucinations and allowing batching with no WER degradation.
|
17 |
|
18 |
### Usage
|
|
|
12 |
- ⚡️ Batched inference for **70x** real-time transcription using Whisper large-v2.
|
13 |
- 🪶 A faster-whisper backend, requiring **<8GB GPU memory** for large-v2 with beam_size=5.
|
14 |
- 🎯 Accurate word-level timestamps using wav2vec2 alignment.
|
15 |
+
- 👯♂️ Multispeaker ASR using speaker diarization(includes speaker ID labels).
|
16 |
- 🗣️ VAD preprocessing, reducing hallucinations and allowing batching with no WER degradation.
|
17 |
|
18 |
### Usage
|