---
license: cc-by-4.0
---
https://research.google.com/ava/download.html

# AVA Speech Dataset

The AVA-Speech dataset annotates speech activity for the movie clips in the AVA v1.0 dataset.
It explicitly labels 3 background noise conditions (Clean Speech, Speech with background Music, and Speech with background Noise), resulting in ~40K labeled segments spanning 40 hours of data. Please visit the project page for more details on the dataset.


This dataset contains the videos audios and their Speech/Noise labels.