metadata
license: other
license_name: qwen-research
license_link: https://huggingface.co./Qwen/Qwen2.5-Coder-3B-Instruct/blob/main/LICENSE
language:
- en
base_model: Qwen/Qwen2.5-Coder-3B-Instruct
pipeline_tag: text-generation
library_name: transformers
tags:
- code
- codeqwen
- chat
- qwen
- qwen-coder
- mlx
moot20/Qwen2.5-Coder-3B-Instruct-MLX-4bits
The Model moot20/Qwen2.5-Coder-3B-Instruct-MLX-4bits was converted to MLX format from Qwen/Qwen2.5-Coder-3B-Instruct using mlx-lm version 0.21.1.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("moot20/Qwen2.5-Coder-3B-Instruct-MLX-4bits")
prompt = "hello"
if tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, add_generation_prompt=True
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)