delayedkarma
commited on
Commit
•
44177ad
1
Parent(s):
874e8ca
Update README.md
Browse files
README.md
CHANGED
@@ -13,10 +13,8 @@ tags:
|
|
13 |
- dpo
|
14 |
- rlhf
|
15 |
datasets:
|
16 |
-
- mlabonne/chatml_dpo_pairs
|
17 |
- Intel/orca_dpo_pairs
|
18 |
base_model: teknium/OpenHermes-2.5-Mistral-7B
|
19 |
-
|
20 |
---
|
21 |
### Credits: Maxime Labonne https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac
|
22 |
|
|
|
13 |
- dpo
|
14 |
- rlhf
|
15 |
datasets:
|
|
|
16 |
- Intel/orca_dpo_pairs
|
17 |
base_model: teknium/OpenHermes-2.5-Mistral-7B
|
|
|
18 |
---
|
19 |
### Credits: Maxime Labonne https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac
|
20 |
|