Neurai
/

NeuraSpeech_900h

Automatic Speech Recognition

Model card Files Files and versions Community

NeuraSpeech_900h / README.md

Neura's picture

Update README.md

052eb30 verified 6 months ago

|

1.37 kB

	---
	library_name: Nvidia Nemo
	license: apache-2.0
	language:
	- fa
	pipeline_tag: automatic-speech-recognition
	tags:
	- Persian
	- Neura
	- PersianASR
	datasets:
	- common_voice_17_0
	---
	# Neura Speech Nemo

	<p align="center">
	<img src="neura_speech.png" width=512 height=256 />
	</p>

	<!-- Provide a quick summary of what the model is/does. -->

	## Model Description

	<!-- Provide a longer summary of what this model is. -->

	- Developed by: Neura company
	- Funded by: Neura
	- Model type: fa_FastConformers_Transducer
	- Language(s) (NLP): Persian

	## Model Architecture

	The model features an improved Conformer architecture from
	[Fast Conformer with Linearly Scalable Attention for Efficient
	Speech Recognition](https://arxiv.org/abs/2305.05084).

	## Uses

	make sure these packages are installed:
	```
	!pip install nemo_toolkit['all']
	```
	```python
	import nemo
	print('nemo', nemo.__version__)
	import numpy as np
	import nemo.collections.asr as nemo_asr

	asr_model = nemo_asr.models.EncDecRNNTBPEModel.from_pretrained(model_name="Neurai/NeuraSpeech_900h")
	asr_model.transcribe(paths2audio_files=['persian_audio.wav', ], batch_size=1)[0]

	```
	trascribed text :
	```
	او خواهان آزاد کردن بردگان بود
	```


	## More Information
	https://neura.info

	## Model Card Authors
	Esmaeil Zahedi, Mohsen Yazdinejad

	## Model Card Contact
	[email protected]