Dynamic-SUPERB

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

hungyilee authored a paper about 1 month ago

Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging

Rezzsl updated a dataset about 1 month ago

DynamicSuperb/SuperbOODAsrSpon_CHIME6-Test

Rezzsl updated a dataset about 1 month ago

DynamicSuperb/SuperbER_RAVDESS

View all activity

DynamicSuperb's activity

Rezzsl

updated a dataset 13 days ago

DynamicSuperb/HEARMusicGenreClassification_ISMIR04

Viewer • Updated 13 days ago • 200 • 28

hungyilee

authored a paper about 1 month ago

Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging

Paper • 2412.19512 • Published Dec 27, 2024 • 8

kunaldhawan

authored 3 papers 3 months ago

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

Paper • 2310.12378 • Published Oct 18, 2023

Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer

Paper • 2306.08753 • Published Jun 14, 2023 • 1

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach

Paper • 2309.05248 • Published Sep 11, 2023

jhansss

authored a paper 5 months ago

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Paper • 2409.09506 • Published Sep 14, 2024 • 4

lca0503

authored a paper 7 months ago

DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging

Paper • 2407.01470 • Published Jul 1, 2024 • 5

hungyilee

authored a paper 7 months ago

DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging

Paper • 2407.01470 • Published Jul 1, 2024 • 5

wanchichen

authored 6 papers 7 months ago

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Paper • 2305.10615 • Published May 18, 2023 • 1

Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning

Paper • 2309.15317 • Published Sep 26, 2023

Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data

Paper • 2309.13876 • Published Sep 25, 2023 • 1

Improving Massively Multilingual ASR With Auxiliary CTC Objectives

Paper • 2302.12829 • Published Feb 24, 2023

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Paper • 2401.16658 • Published Jan 30, 2024 • 14

YODAS: Youtube-Oriented Dataset for Audio and Speech

Paper • 2406.00899 • Published Jun 2, 2024 • 2

juice500

authored a paper about 1 year ago

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Paper • 2401.16658 • Published Jan 30, 2024 • 14

NTUVictor

updated a dataset about 1 year ago

DynamicSuperb/SourceSeparation_libri2Mix_test

Viewer • Updated Dec 29, 2023 • 2k • 48 • 1

kuanhuggingface

updated 4 datasets about 1 year ago

DynamicSuperb/VoiceConversion_VCTK

Viewer • Updated Nov 30, 2023 • 2k • 39

DynamicSuperb/Text2Speech_LibriTTS-TestOther

Viewer • Updated Nov 30, 2023 • 4.89k • 29

DynamicSuperb/Text2Speech_LibriTTS-TestClean

Viewer • Updated Nov 30, 2023 • 4.92k • 36

DynamicSuperb/Text2Speech_LJSpeech

Viewer • Updated Nov 30, 2023 • 13.1k • 66