wk-llama3.2-1b / README.md
hensam92's picture
4138c5ca9aa9e7b9bc3e289f0cb31e1d46c9ca6f968e1d066e084ef7c349b67a
5e49532 verified
|
raw
history blame
932 Bytes
---
base_model: unsloth/Llama-3.2-1B-Instruct
language:
- en
library_name: transformers
license: llama3.2
tags:
- llama-3
- llama
- meta
- facebook
- unsloth
- transformers
- mlx
---
# hensam92/wk-llama3.2-1b
The Model [hensam92/wk-llama3.2-1b](https://huggingface.co./hensam92/wk-llama3.2-1b) was converted to MLX format from [unsloth/Llama-3.2-1B-Instruct](https://huggingface.co./unsloth/Llama-3.2-1B-Instruct) using mlx-lm version **0.18.2**.
## Use with mlx
```bash
pip install mlx-lm
```
```python
from mlx_lm import load, generate
model, tokenizer = load("hensam92/wk-llama3.2-1b")
prompt="hello"
if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, tokenize=False, add_generation_prompt=True
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)
```