dom-tokenizer-100k / special_tokens_map.json
gbenson's picture
dom-tokenizer-10k's big sister
80bd862
raw
history blame contribute delete
310 Bytes
{
"additional_special_tokens": [
"[BOS]",
"[EOS]",
"[TAG]",
"[ATTR]",
"[COMMENT]",
"[BASE64]",
"[LONG]"
],
"bos_token": "[BOS]",
"cls_token": "[CLS]",
"eos_token": "[EOS]",
"mask_token": "[MASK]",
"pad_token": "[PAD]",
"sep_token": "[SEP]",
"unk_token": "[UNK]"
}