waveletdeboshir
/

whisper-base-ru-pruned

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

waveletdeboshir commited on Aug 15

Commit

261e132

•

1 Parent(s): 0a192f7

Update README.md

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -29,6 +29,31 @@ Model size is 30%  less then original whisper-base:
 | model file size | 290 Mb | 203 Mb |
 | vocab_size | 51865 | 4705 |
 ## Other pruned whisper models
 * [waveletdeboshir/whisper-tiny-ru-pruned](https://huggingface.co/waveletdeboshir/whisper-tiny-ru-pruned)
 * [waveletdeboshir/whisper-small-ru-pruned](https://huggingface.co/waveletdeboshir/whisper-small-ru-pruned)

 | model file size | 290 Mb | 203 Mb |
 | vocab_size | 51865 | 4705 |
+## Usage
+Model can be used as an original whisper:
+```python
+>>> from transformers import WhisperProcessor, WhisperForConditionalGeneration
+>>> import torchaudio
+>>> # load audio
+>>> wav, sr = torchaudio.load("audio.wav")
+>>> # load model and processor
+>>> processor = WhisperProcessor.from_pretrained("waveletdeboshir/whisper-base-ru-pruned")
+>>> model = WhisperForConditionalGeneration.from_pretrained("waveletdeboshir/whisper-base-ru-pruned")
+>>> input_features = processor(wav[0], sampling_rate=sr, return_tensors="pt").input_features
+>>> # generate token ids
+>>> predicted_ids = model.generate(input_features)
+>>> # decode token ids to text
+>>> transcription = processor.batch_decode(predicted_ids, skip_special_tokens=False)
+['<|startoftranscript|><|ru|><|transcribe|><|notimestamps|> Начинаем работу.<|endoftext|>']
+```
+The context tokens can be removed from the start of the transcription by setting `skip_special_tokens=True`.
 ## Other pruned whisper models
 * [waveletdeboshir/whisper-tiny-ru-pruned](https://huggingface.co/waveletdeboshir/whisper-tiny-ru-pruned)
 * [waveletdeboshir/whisper-small-ru-pruned](https://huggingface.co/waveletdeboshir/whisper-small-ru-pruned)