HamzaSidhu786 commited on
Commit
cd87244
1 Parent(s): 4bb2f4f

Upload tokenizer

Browse files
added_tokens.json CHANGED
@@ -1,4 +1,6 @@
1
  {
2
- "</s>": 22,
3
- "<s>": 21
 
 
4
  }
 
1
  {
2
+ "</s>": 19383,
3
+ "<pad>": 19385,
4
+ "<s>": 19382,
5
+ "<unk>": 19384
6
  }
special_tokens_map.json CHANGED
@@ -1,6 +1,6 @@
1
  {
2
  "bos_token": "<s>",
3
  "eos_token": "</s>",
4
- "pad_token": "[PAD]",
5
- "unk_token": "[UNK]"
6
  }
 
1
  {
2
  "bos_token": "<s>",
3
  "eos_token": "</s>",
4
+ "pad_token": "<pad>",
5
+ "unk_token": "<unk>"
6
  }
tokenizer_config.json CHANGED
The diff for this file is too large to render. See raw diff
 
vocab.json CHANGED
The diff for this file is too large to render. See raw diff