Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers Paper • 2312.11123 • Published Dec 18, 2023
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation Paper • 2201.03713 • Published Jan 11, 2022
SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System Paper • 2104.02125 • Published Apr 5, 2021
Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech Paper • 2202.12163 • Published Feb 24, 2022
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models Paper • 2401.03506 • Published Jan 7, 2024 • 13
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech Paper • 1911.01601 • Published Nov 5, 2019
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Paper • 1806.04558 • Published Jun 12, 2018