Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity Paper • 2407.10387 • Published Jul 15, 2024 • 7
Pushing the Limits of Zero-shot End-to-End Speech Translation Paper • 2402.10422 • Published Feb 16, 2024
Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23 Paper • 2306.01327 • Published Jun 2, 2023
Explaining How Transformers Use Context to Build Predictions Paper • 2305.12535 • Published May 21, 2023
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations Paper • 2212.09699 • Published Dec 19, 2022
Efficient Speech Translation with Dynamic Latent Perceivers Paper • 2210.16264 • Published Oct 28, 2022
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation Paper • 2202.04774 • Published Feb 9, 2022
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021 Paper • 2105.04512 • Published May 10, 2021