Speech + text dataset collection based on the ParlaMint data. Paper describing the construction process: https://www.arxiv.org/abs/2409.15397.
CLASSLA - CLARIN Knowledge Centre for South Slavic Languages
university
AI & ML interests
NLP for South Slavic (and other under-resourced) languages
Recent Activity
Organization Card
The CLARIN Knowledge Centre for South Slavic languages (CLASSLA) offers expertise on language resources and technologies for South Slavic languages.
Its basic activities are:
- giving researchers, students, citizen scientists and other interested parties information on the available resources and technologies via its documentation
- supporting them in producing, modifying or publishing resources and technologies via its helpdesk
- organizing training activities
Collections
1
models
25
classla/wav2vecbert2-filledPause
Audio Classification
•
Updated
•
17
classla/multilingual-IPTC-news-topic-classifier
Text Classification
•
Updated
•
41.6k
•
9
classla/xlm-roberta-base-multilingual-text-genre-classifier
Text Classification
•
Updated
•
366
•
28
classla/wav2vec2-large-slavic-parlaspeech-hr-lm
Automatic Speech Recognition
•
Updated
•
66
•
3
classla/xlm-r-bertic
Fill-Mask
•
Updated
•
40
•
3
classla/xlm-r-slobertic
Fill-Mask
•
Updated
•
37
classla/whisper-large-v3-mici-princ
Automatic Speech Recognition
•
Updated
•
14
•
1
classla/xlm-r-parlasent
Text Classification
•
Updated
•
597
classla/xlm-r-parla
Fill-Mask
•
Updated
•
23
classla/wav2vec2-large-slavic-parlaspeech-hr
Automatic Speech Recognition
•
Updated
•
61
•
2
datasets
21
classla/ParlaSpeech-PL
Viewer
•
Updated
•
531k
•
268
•
1
classla/ParlaSpeech-HR
Viewer
•
Updated
•
868k
•
579
•
1
classla/ParlaSpeech-RS
Viewer
•
Updated
•
278k
•
2.16k
classla/mak_na_konac
Viewer
•
Updated
•
8.46k
•
39
•
1
classla/Mici_Princ
Viewer
•
Updated
•
372
•
29
•
1
classla/ParlaSpeech-CZ
Viewer
•
Updated
•
711k
•
763
•
1
classla/xlm-r-bertic-data
Updated
•
68
•
2
classla/COPA-MK
Viewer
•
Updated
•
1k
•
35
classla/COPA-SR_lat
Viewer
•
Updated
•
1k
•
31
classla/COPA-SR
Viewer
•
Updated
•
1k
•
38