Spaces:

openai
/

whisper

Running on L4

App Files Files Community

125

Try to do code-switching. It seems perfect.

#45

by ryL - opened Nov 6, 2022

Discussion

ryL

Nov 6, 2022

Input audio:

Transcription:

I think I am trying to speak a three kind of language simultaneously.我又講中文、又講英文。Sometimes I can speak 日本語。日本語は話せできます。おはようございます。さようなら。

madrage57

Mar 13, 2023

•

edited Mar 13, 2023

How did you bypass the singular language detection? It seems it is selecting a single language for each audio clip when I use it.

sanchit-gandhi

Mar 17, 2023

It seems it is selecting a single language for each audio clip when I use it

This is the expected behaviour! Not sure how @ryL got multiple language outputs?

soheilasoheilasoheila

Apr 11, 2023

could you please share your code? @ryL

HOw could you get multiple languages in your transcriptions?

remS

Apr 24, 2023

I would also be very interested in knowing how @ryL got it to work for a single audio clip!

odusseys

Aug 3, 2023

For anyone stumbling here - I made it work using an additional prompt:

You are a professional transcriber, fluent in language1 and language2.
You are listening to a recording in which a person is potentially speaking both language1 and language2, and no other languages.
They may be speaking only one of these languages. They may have a strong accent.
You are to transcribe utterances of each language accordingly.

yashsandansing

Jan 15

@odusseys It seems like you used Whisper in GPT, did you?
@ryL could you share the code that got you code-switching output in Whisper? Or maybe what approach you took for the same?

syq163

Mar 1

Using whisperx directly could get the the same transcription, almost perfect.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment