NlpHUST
/

gpt2-vietnamese

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nhanv commited on May 30, 2022

Commit

eae29b9

·

1 Parent(s): b3416f2

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ A 12-layer, 768-hidden-size transformer-based language model.
 # Training
 The model was trained on Vietnamese Oscar dataset (32 GB) to optimize a traditional language modelling objective on v3-8 TPU for around 6 days. It reaches around 13.4 perplexity on a chosen validation set from Oscar.
-### GPT-2 Fineturning
 The following example fine-tunes GPT-2 on WikiText-2. We're using the raw WikiText-2 (no tokens were replaced before
 the tokenization). The loss here is that of causal language modeling.

 # Training
 The model was trained on Vietnamese Oscar dataset (32 GB) to optimize a traditional language modelling objective on v3-8 TPU for around 6 days. It reaches around 13.4 perplexity on a chosen validation set from Oscar.
+### GPT-2 Finetuning
 The following example fine-tunes GPT-2 on WikiText-2. We're using the raw WikiText-2 (no tokens were replaced before
 the tokenization). The loss here is that of causal language modeling.