metadata
license: cc-by-nc-sa-4.0
language:
- lb
- de
- fr
- en
- pt
tags:
- STT
- ASR
- audio
- speech recognition
- coqui.ai
datasets:
- mbarnig/lb-STT-CORPUS
The luxembourgish part of my multilingual automatic speech recognition (ASR) model is the second Machine Learning (ML) STT model for Luxembourgish. The very first model has been published in May 2022 by Pr Peter Gilles of the University of Luxembourg.
My model has been trained from scratch with my customized dataset mbarnig/lb-STT_CORPUS and the deep-learning-toolkit 🐸 Coqui-STT (version 1.3.0). The model was trained without punctuations with the following alphabet:
# Each line in this file represents the Unicode codepoint (UTF-8 encoded)
# associated with a numeric index.
# A line that starts with # is a comment. You can escape it with \# if you wish
# to use '#' in the Alphabet.
'abcdefghijklmnopqrstuvwxyz àáâäçèéëîôöûü
# The last (non-comment) line needs to end with a newline.