|
--- |
|
license: apache-2.0 |
|
tags: |
|
- Tensorflow |
|
- Spoken Language Identification |
|
language: |
|
- en |
|
- zh |
|
metrics: |
|
- accuracy |
|
pipeline_tag: audio-classification |
|
--- |
|
|
|
# Spoken_language_identification |
|
|
|
## Model description |
|
|
|
This is a spoken language recognition model trained on private dataset using Tensorflow. |
|
the model uses the CRNN-Attention architecture that has previously been used for extracting utterance-level feature representations. |
|
|
|
The system is trained with recordings sampled at 16kHz, single channel, and 16-bit Signed Integer PCM encoding. |
|
|
|
The model can classify a speech utterance according to the language spoken. |
|
It covers 13 different languages( |
|
chinese |
|
english |
|
french |
|
german |
|
indonesian |
|
italian |
|
japanese |
|
korean |
|
portuguese |
|
russian |
|
spanish |
|
turkish |
|
vietnamese |
|
) |
|
|
|
## Intended uses & Limitations |
|
|
|
#### How to use |
|
|
|
```python |
|
|
|
from huggingface_hub import from_pretrained_keras |
|
|
|
model = from_pretrained_keras("SpeechFlow/spoken_language_identification") |
|
|
|
|
|
``` |