MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published about 1 month ago • 43
CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models Paper • 2412.10117 • Published Dec 13, 2024 • 3