--- license: cc-by-4.0 --- https://research.google.com/ava/download.html # AVA Speech Dataset The AVA-Speech dataset annotates speech activity for the movie clips in the AVA v1.0 dataset. It explicitly labels 3 background noise conditions (Clean Speech, Speech with background Music, and Speech with background Noise), resulting in ~40K labeled segments spanning 40 hours of data. Please visit the project page for more details on the dataset. This dataset contains the videos audios and their Speech/Noise labels.