chatbot-blog / app.py
merve's picture
merve HF staff
Update app.py
5adb4c6
raw
history blame
3.49 kB
import streamlit as st
from transformers import pipeline
st.header("Ways to Improve Your Conversational Agents using πŸ€— Hugging Face")
st.write("There are many ways to improve your conversational agents using language models. In this blog post, I want to go through couple of know-hows I've learnt during my time as a machine learning engineer making chatbots for living.")
st.subheader("Data Augmentation with Generative Models ✨")
st.write("There are cases where you will not be allowed to keep data, you will have to start from scratch or you will have very little amount of data. We'll go over two use cases and see how to tackle them.")
st.write("Imagine you're making a chatbot that will answer very general questions about emergency situations at home.")
st.write("If you have very little amount of data, you could actually augment it through language models. There are regex based tools you can use but they tend to create bias due to repetitive patterns, so it's better to use language models for this case. A good model to use is a generative model fine-tuned on Quora Question Pairs dataset. This dataset consists of question pairs that are paraphrase of one another, and T5 can generate a paraphrased question given a source question.")
st.write("Try it yourself here πŸ‘‡πŸ»")
generator = pipeline("text2text-generation", model = "mrm8488/t5-small-finetuned-quora-for-paraphrasing")
default_value = "How can I put out grease fire?"
sent = st.text_area("Input", default_value, height = 10)
outputs = generator(sent)
st.write("Paraphrased Example:")
st.write(outputs[0]["generated_text"])
st.subheader("Add Personas to Your Conversational Agent using GPT-2")
st.subheader("Multilingual Models using Translation Models")
st.write("Scaling your chatbot across different languages is expensive and cumbersome. There are couple of ways on how to make your chatbot speak a different language. You can either translate the intent classification data and responses and train another model and deploy it,, or you can put translation models at two ends. There are advantages and disadvantages in both approaches. For the first one, you can assess the performance of the model and hand your responses to a native speaker to have more control over what your bot says, but it requires more resources compared to second one. For the second one, assume that you're making a chatbot that is in English and want to have another language, say, German. You need two models, from German to English and from English to German.")
st.image("./chatbot.png")
st.write("Your English intent classification model will be between these two models, your German to English model will translate the input to English and the output will go through the intent classification model, which will classify intent and select appropriate response (which is currently in English). The response will be translated back to German, which you can do in advance and do proofreading with a native speaker or directly pass it to a from English to German language model. For this use case, I highly recommend specific translation models instead of using sequence-to-sequence multilingual models like T5. ")
model_checkpoint = "Helsinki-NLP/opus-mt-en-fr"
translator = pipeline("translation", model=model_checkpoint)
sent = st.text_area("Input", default_value, height = 10)
outputs = translator(sent)
st.write("Translated Example:")
translated_text = translator("How are you?")
st.write(outputs[0]["translation_text"])