Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging Paper • 2412.19512 • Published Dec 27, 2024 • 8
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System Paper • 2310.12378 • Published Oct 18, 2023
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer Paper • 2306.08753 • Published Jun 14, 2023 • 1
Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach Paper • 2309.05248 • Published Sep 11, 2023
ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration Paper • 2409.09506 • Published Sep 14, 2024 • 4
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging Paper • 2407.01470 • Published Jul 1, 2024 • 5
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging Paper • 2407.01470 • Published Jul 1, 2024 • 5
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark Paper • 2305.10615 • Published May 18, 2023 • 1
Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning Paper • 2309.15317 • Published Sep 26, 2023
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data Paper • 2309.13876 • Published Sep 25, 2023 • 1
Improving Massively Multilingual ASR With Auxiliary CTC Objectives Paper • 2302.12829 • Published Feb 24, 2023
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Paper • 2401.16658 • Published Jan 30, 2024 • 14
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Paper • 2401.16658 • Published Jan 30, 2024 • 14