emmazhong's picture
Update README.md
a72b78f
|
raw
history blame
963 Bytes
metadata
license: apache-2.0
tags:
  - Tensorflow
  - Spoken Language Identification
language:
  - en
  - zh
metrics:
  - accuracy
pipeline_tag: audio-classification

Spoken_language_identification

Model description

This is a spoken language recognition model trained on private dataset using Tensorflow. the model uses the CRNN-Attention architecture that has previously been used for extracting utterance-level feature representations.

The system is trained with recordings sampled at 16kHz, single channel, and 16-bit Signed Integer PCM encoding.

The model can classify a speech utterance according to the language spoken. It covers 13 different languages( chinese english french german indonesian italian japanese korean portuguese russian spanish turkish vietnamese )

Intended uses & Limitations

How to use


from huggingface_hub import from_pretrained_keras

model = from_pretrained_keras("SpeechFlow/spoken_language_identification")