cmcmaster commited on
Commit
002888e
·
verified ·
1 Parent(s): 092391e

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: unsloth/Llama-3.2-3B
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ license: llama3.2
7
+ tags:
8
+ - llama-3
9
+ - llama
10
+ - meta
11
+ - facebook
12
+ - unsloth
13
+ - transformers
14
+ - mlx
15
+ - mlx-my-repo
16
+ ---
17
+
18
+ # cmcmaster/Llama-3.2-3B-Q4-mlx
19
+
20
+ The Model [cmcmaster/Llama-3.2-3B-Q4-mlx](https://huggingface.co/cmcmaster/Llama-3.2-3B-Q4-mlx) was converted to MLX format from [unsloth/Llama-3.2-3B](https://huggingface.co/unsloth/Llama-3.2-3B) using mlx-lm version **0.20.5**.
21
+
22
+ ## Use with mlx
23
+
24
+ ```bash
25
+ pip install mlx-lm
26
+ ```
27
+
28
+ ```python
29
+ from mlx_lm import load, generate
30
+
31
+ model, tokenizer = load("cmcmaster/Llama-3.2-3B-Q4-mlx")
32
+
33
+ prompt="hello"
34
+
35
+ if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
36
+ messages = [{"role": "user", "content": prompt}]
37
+ prompt = tokenizer.apply_chat_template(
38
+ messages, tokenize=False, add_generation_prompt=True
39
+ )
40
+
41
+ response = generate(model, tokenizer, prompt=prompt, verbose=True)
42
+ ```