Macedonian-ASR
/

whisper-large-v3-macedonian-asr

Model card Files Files and versions Community

Porjaz commited on Sep 30

Commit

6b06f03

•

1 Parent(s): fb5a42b

Update README.md

Files changed (1) hide show

README.md +31 -3

README.md CHANGED Viewed

@@ -1,3 +1,31 @@
----
-license: cc-by-4.0
----

+---
+license: cc-by-4.0
+language:
+- mk
+base_model:
+- openai/whisper-large-v3
+---
+# Fine-tuned whisper-large-v3 model for speech recognition in Macedonian
+Authors:
+1. Dejan Porjazovski
+2. Ilina Jakimovska
+3. Ordan Chukaliev
+4. Nikola Stikov
+This collaboration is part of the activities of the Center for Advanced Interdisciplinary Research (CAIR) at UKIM.
+## Data used for training
+In training of the model, we used the following data sources:
+1. Digital Archive for Ethnological and Anthropological Resources (DAEAR) at the Institutе of Ethnology and Anthropology, PMF, UKIM.
+2. Audio version of the international journal "EthnoAnthropoZoom" at the Institutе of Ethnology and Anthropology, PMF, UKIM.
+3. The podcast "Обични луѓе" by Ilina Jakimovska.
+4. The scientific videos from the series "Наука за деца", foundation KANTAROT.
+5. Macedonian version of the Mozilla Common Voice (version 18).
+## Usage
+When using this model, make sure that your speech input is sampled at 16kHz.