File size: 896 Bytes
eb403da
 
b717995
 
 
 
 
 
 
 
 
 
 
 
 
 
 
eb403da
b717995
 
 
 
b9f93e0
 
 
 
 
 
 
 
 
 
773f9bf
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
license: apache-2.0
datasets:
- librispeech_asr
language:
- en
metrics:
- wer
pipeline_tag: automatic-speech-recognition
tags:
- asr
- rescoring
- rnn-t
- gpt2
- nemo
- lstm
- kenlm
---

The data is used in project https://github.com/Alexander92-cpu/LanguageModel_Fusion

Data desciption:

- 'asr/stt_en_conformer_transducer_small.nemo' - NeMo ASR pre-trained RNN-T model (https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/stt_en_conformer_transducer_small);

- 'gpt2' - fine-tuned GPT-2 LM model for rescoring (https://huggingface.co./docs/transformers/model_doc/gpt2#transformers.GPT2LMHeadModel);

- 'kenlm/4_ngram_output.bin' - 4-gram language model;

- 'lstm' - trained from scratch word-level LSTM LM model and the corresponding tokenizer;

- 'text' - contains text data used for training, validation, and testing.

- 'optimize' - data and results of optimization experiments