Wav2Vec 2.0
A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data.
Automatic Speech Recognition • Updated • 1.21M • 139Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled audio data from the LibriSpeech and LibriVox (LV) corpora, and fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant Wav2Vec 2.0 checkpoint from the initial release, obtaining 1.9/3.9% WER on the LibriSpeech test clean/other subsets respectively.
facebook/wav2vec2-large-960h
Automatic Speech Recognition • Updated • 76.8k • 27Note The Wav2Vec 2.0 "large" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-960h
Automatic Speech Recognition • Updated • 1.7M • 309Note The Wav2Vec 2.0 "base" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-100h
Automatic Speech Recognition • Updated • 3.01k • 6Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data, and fine-tuned on 100 hours of labelled LibriSpeech ASR data.
facebook/wav2vec2-large-lv60
Updated • 13.9k • 6Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled data from the LibriSpeech and LibriVox (LV) corpora.
facebook/wav2vec2-large
Updated • 5.8k • 4Note The Wav2Vec 2.0 "large" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
facebook/wav2vec2-base
Updated • 1.56M • 88Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Paper • 2006.11477 • Published • 5Note The wav2vec 2.0 paper, accepted to NeurIPS 2020.