Optimize seamlessM4T medium model for faster performance
#7
by
sanjitaa
- opened
I am trying to use seamlessm4t medium model on my project for speech to text translation. But I want the model to respond/predict faster. It is taking too much long time for it. What can be the best ideas for it?
Can someone help me with it ?
@sanjitaa what task are you working on, is it speech-to-speech? if so, I recommend using the v2 model here (it's 3x faster than large-v1)
The v2 model is better (more accurate in terms of ASR-BLEU and fatser) see this table from the paper:
These are averages across directions, but if you have a particular translation direction in mind, check the Tables 69-71 / pages 120-122 in the appendix here