Locutusque
/

gpt2-medium-conversational

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Locutusque commited on May 20, 2023

Commit

fa7bc74

•

1 Parent(s): 2bfe150

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -32,9 +32,9 @@ The model architecture used in this model is GPT-2, a transformer-based language
 ## Evaluation Metrics
 The model is evaluated based on several metrics, including loss, reward, penalty, BLEU score, and perplexity. The loss metric is calculated during training and reflects the difference between the predicted output and the actual output. The reward metric is based on the number of correct words generated by the model, while the penalty metric penalizes the model for repeating words consecutively. The BLEU score measures the similarity between the generated text and the ground truth text, while the perplexity metric measures how well the model is able to predict the next word in a sequence. During validation, the model achieved the following metrics:
-- BLEU score: 11.4
-- perplexity: 125
-- loss: 3.8
 ## Limitations and Bias
 This model is not suitable for all use cases due to its limited training time on a weak computer. As a result, it may produce irrelevant or nonsensical responses. Additionally, it has not been fine-tuned to remember the chat history, is unable to provide follow-up responses, and it does not know the answer to many questions (it was only fine-tuned to respond in a conversational way). For optimal performance, we recommend using a GPU with at least 10GB of VRAM and downloading the model manually instead of using the Transformers library. Here's how you should deploy the model:

 ## Evaluation Metrics
 The model is evaluated based on several metrics, including loss, reward, penalty, BLEU score, and perplexity. The loss metric is calculated during training and reflects the difference between the predicted output and the actual output. The reward metric is based on the number of correct words generated by the model, while the penalty metric penalizes the model for repeating words consecutively. The BLEU score measures the similarity between the generated text and the ground truth text, while the perplexity metric measures how well the model is able to predict the next word in a sequence. During validation, the model achieved the following metrics:
+- BLEU score: 9.7
+- perplexity: 5
+- loss: 1.2
 ## Limitations and Bias
 This model is not suitable for all use cases due to its limited training time on a weak computer. As a result, it may produce irrelevant or nonsensical responses. Additionally, it has not been fine-tuned to remember the chat history, is unable to provide follow-up responses, and it does not know the answer to many questions (it was only fine-tuned to respond in a conversational way). For optimal performance, we recommend using a GPU with at least 10GB of VRAM and downloading the model manually instead of using the Transformers library. Here's how you should deploy the model: