Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ model-index:
|
|
15 |
|
16 |
This model is a fine-tuned version of [HuggingFaceH4/zephyr-7b-gemma-sft-v0.1](https://huggingface.co/HuggingFaceH4/zephyr-7b-gemma-sft-v0.1) on the argilla/dpo-mix-7k dataset.
|
17 |
|
18 |
-
This model is from the paper ["Discovering Preference Optimization Algorithms with and for Large Language Models"](https://
|
19 |
|
20 |
Read the [blog post on it here!](https://sakana.ai/)
|
21 |
|
|
|
15 |
|
16 |
This model is a fine-tuned version of [HuggingFaceH4/zephyr-7b-gemma-sft-v0.1](https://huggingface.co/HuggingFaceH4/zephyr-7b-gemma-sft-v0.1) on the argilla/dpo-mix-7k dataset.
|
17 |
|
18 |
+
This model is from the paper ["Discovering Preference Optimization Algorithms with and for Large Language Models"](https://arxiv.org/abs/2406.08414)
|
19 |
|
20 |
Read the [blog post on it here!](https://sakana.ai/)
|
21 |
|