Commit
•
400e233
1
Parent(s):
66378ee
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,59 @@
|
|
1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: mit
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
pipeline_tag: text-generation
|
3 |
+
inference: false
|
4 |
+
tags:
|
5 |
+
- notus
|
6 |
+
- mlx
|
7 |
+
language:
|
8 |
+
- en
|
9 |
license: mit
|
10 |
+
library_name: mlx
|
11 |
---
|
12 |
+
|
13 |
+
# Notus 7B v1
|
14 |
+
|
15 |
+
Notus is a collection of fine-tuned models using Direct Preference Optimization (DPO) and related RLHF techniques following a data-first approach. This model is the first version, fine-tuned with DPO on top of [`alignment-handbook/zephyr-7b-sft-full`](https://huggingface.co/alignment-handbook/zephyr-7b-sft-full), which is the SFT model produced to create [`HuggingFaceH4/zephyr-7b-beta`](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta).
|
16 |
+
|
17 |
+
More information at [argilla/notus-7b-v1](https://huggingface.co/argilla/notus-7b-v1)
|
18 |
+
|
19 |
+
This repository contains the `notus-7b-v1` weights in `npz` format suitable for use with Apple's MLX framework.
|
20 |
+
|
21 |
+
## Use with MLX
|
22 |
+
|
23 |
+
```bash
|
24 |
+
pip install mlx
|
25 |
+
pip install huggingface_hub hf_transfer
|
26 |
+
git clone https://github.com/ml-explore/mlx-examples.git
|
27 |
+
cd mlx-examples
|
28 |
+
|
29 |
+
# Download model
|
30 |
+
export HF_HUB_ENABLE_HF_TRANSFER=1
|
31 |
+
huggingface-cli download --local-dir-use-symlinks False --local-dir notus-7b-v1 mlx-community/notus-7b-v1
|
32 |
+
|
33 |
+
# Run example
|
34 |
+
python llms/mistral/mistral.py --model-path notus-7b-v1 --prompt "My name is"
|
35 |
+
```
|
36 |
+
|
37 |
+
Please, refer to the [original model card](https://huggingface.co/argilla/notus-7b-v1) for more details on Notus 7B v1.
|
38 |
+
|
39 |
+
## Prompt Format
|
40 |
+
|
41 |
+
Please note that this model expects a specific prompt structure.
|
42 |
+
|
43 |
+
```
|
44 |
+
<|system|>
|
45 |
+
{system_prompt}</s>
|
46 |
+
<|user|>
|
47 |
+
{user_prompt}</s>
|
48 |
+
<|assistant|>
|
49 |
+
```
|
50 |
+
|
51 |
+
Here is an example:
|
52 |
+
|
53 |
+
```
|
54 |
+
<|system|>
|
55 |
+
You are a pirate chatbot who always responds with Arr!</s>
|
56 |
+
<|user|>
|
57 |
+
There's a llama on my lawn, how can I get rid of him?</s>
|
58 |
+
<|assistant|>
|
59 |
+
```
|