Porjaz commited on
Commit
6b06f03
1 Parent(s): fb5a42b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -3
README.md CHANGED
@@ -1,3 +1,31 @@
1
- ---
2
- license: cc-by-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ language:
4
+ - mk
5
+ base_model:
6
+ - openai/whisper-large-v3
7
+ ---
8
+
9
+ # Fine-tuned whisper-large-v3 model for speech recognition in Macedonian
10
+
11
+ Authors:
12
+ 1. Dejan Porjazovski
13
+ 2. Ilina Jakimovska
14
+ 3. Ordan Chukaliev
15
+ 4. Nikola Stikov
16
+
17
+ This collaboration is part of the activities of the Center for Advanced Interdisciplinary Research (CAIR) at UKIM.
18
+
19
+ ## Data used for training
20
+
21
+ In training of the model, we used the following data sources:
22
+ 1. Digital Archive for Ethnological and Anthropological Resources (DAEAR) at the Institutе of Ethnology and Anthropology, PMF, UKIM.
23
+ 2. Audio version of the international journal "EthnoAnthropoZoom" at the Institutе of Ethnology and Anthropology, PMF, UKIM.
24
+ 3. The podcast "Обични луѓе" by Ilina Jakimovska.
25
+ 4. The scientific videos from the series "Наука за деца", foundation KANTAROT.
26
+ 5. Macedonian version of the Mozilla Common Voice (version 18).
27
+
28
+
29
+ ## Usage
30
+
31
+ When using this model, make sure that your speech input is sampled at 16kHz.