Spaces:

distil-whisper
/

hallucination-analysis

Running

sanchit-gandhi commited on Oct 24, 2023

Commit

d515eda

1 Parent(s): 941081c

update description

Files changed (1) hide show

app.py CHANGED Viewed

@@ -176,11 +176,18 @@ if __name__ == "__main__":
             """
         )
         gr.Markdown(
-            "Analyse the transcriptions generated by the Whisper and Distil-Whisper models on the TEDLIUM dev set. "
-            "Analysis is performed on the overall level, where statistics are computed over the entire dev set, and also a per-sample level. "
-            "The transcriptions for both models are shown at the bottom of the demo. The text diff for each is computed "
-            "relative to the target transcriptions, where insertions are displayed in <span style='background-color:Lightgreen'>green</span>, and "
-            "deletions in <span style='background-color:#FFCCCB'><s>red</s></span>."
         )
         gr.Markdown("**Overall statistics:**")
         table = gr.Dataframe(

             """
         )
         gr.Markdown(
+            """
+            Analyse the transcriptions generated by the Whisper and Distil-Whisper models on the TED-LIUM dev set.
+            Analysis is performed on the overall level, where statistics are computed over the entire dev set, and also a per-sample level.
+            The transcriptions for both models are shown at the bottom of the demo. The text diff for each is computed
+            relative to the target transcriptions, where insertions are displayed in <span style='background-color:Lightgreen'>green</span>, and
+            deletions in <span style='background-color:#FFCCCB'><s>red</s></span>.
+            To quantify the amount of repetition and hallucination in the predicted transcriptions, we measure the number
+            of repeated 5-gram word duplicates (5-Dup.) and the insertion error rate (IER). Overall, Distil-Whisper has
+            roughly half the number of 5-Dup. and IER. This indicates that it has a lower propensity to hallucinate
+            compared to the Whisper model.
+            """
         )
         gr.Markdown("**Overall statistics:**")
         table = gr.Dataframe(