microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated about 16 hours ago β’ 7.35k β’ 501
OWLS: Scaling Laws for Speech Recognition and Translation Collection π¦ A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. β’ 6 items β’ Updated 3 days ago β’ 3
Open Whisper-style Speech Models (OWSM) Collection Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ β’ 15 items β’ Updated 22 days ago β’ 5
Slamming: Training a Speech Language Model on One GPU in a Day Paper β’ 2502.15814 β’ Published 9 days ago β’ 56
Running 1.78k 1.78k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
Presumed Cultural Identity: How Names Shape LLM Responses Paper β’ 2502.11995 β’ Published 11 days ago β’ 10