Xenova HF staff commited on
Commit
e455a4b
·
verified ·
1 Parent(s): 1bb3101

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -3
README.md CHANGED
@@ -1,3 +1,29 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: transformers
4
+ tags:
5
+ - transformers.js
6
+ - tokenizers
7
+ ---
8
+
9
+ # GPT-4 Tokenizer
10
+
11
+ A 🤗-compatible version of the **GPT-4o tokenizer** (adapted from [openai/tiktoken](https://github.com/openai/tiktoken)). This means it can be used with Hugging Face libraries including [Transformers](https://github.com/huggingface/transformers), [Tokenizers](https://github.com/huggingface/tokenizers), and [Transformers.js](https://github.com/xenova/transformers.js).
12
+
13
+ ## Example usage:
14
+
15
+ ### Transformers/Tokenizers
16
+ ```py
17
+ from transformers import GPT2TokenizerFast
18
+
19
+ tokenizer = GPT2TokenizerFast.from_pretrained('Xenova/gpt-4o')
20
+ assert tokenizer.encode('hello world') == [24912, 2375]
21
+ ```
22
+
23
+ ### Transformers.js
24
+ ```js
25
+ import { AutoTokenizer } from '@xenova/transformers';
26
+
27
+ const tokenizer = await AutoTokenizer.from_pretrained('Xenova/gpt-4');
28
+ const tokens = tokenizer.encode('hello world'); // [24912, 2375]
29
+ ```