emmazhong's picture
Update README.md
a72b78f
|
raw
history blame
963 Bytes
---
license: apache-2.0
tags:
- Tensorflow
- Spoken Language Identification
language:
- en
- zh
metrics:
- accuracy
pipeline_tag: audio-classification
---
# Spoken_language_identification
## Model description
This is a spoken language recognition model trained on private dataset using Tensorflow.
the model uses the CRNN-Attention architecture that has previously been used for extracting utterance-level feature representations.
The system is trained with recordings sampled at 16kHz, single channel, and 16-bit Signed Integer PCM encoding.
The model can classify a speech utterance according to the language spoken.
It covers 13 different languages(
chinese
english
french
german
indonesian
italian
japanese
korean
portuguese
russian
spanish
turkish
vietnamese
)
## Intended uses & Limitations
#### How to use
```python
from huggingface_hub import from_pretrained_keras
model = from_pretrained_keras("SpeechFlow/spoken_language_identification")
```