dom-tokenizer-10k / special_tokens_map.json
gbenson's picture
Configure [BOS] and [EOS] properly
7bc4f29
raw
history blame
310 Bytes
{
"additional_special_tokens": [
"[BOS]",
"[EOS]",
"[TAG]",
"[ATTR]",
"[COMMENT]",
"[BASE64]",
"[LONG]"
],
"bos_token": "[BOS]",
"cls_token": "[CLS]",
"eos_token": "[EOS]",
"mask_token": "[MASK]",
"pad_token": "[PAD]",
"sep_token": "[SEP]",
"unk_token": "[UNK]"
}