grimjim
/

kuno-kunoichi-v1-DPO-v2-SLERP-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

grimjim commited on Mar 12

Commit

4ce4cea

•

1 Parent(s): 88d5b63

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -11,10 +11,9 @@ license: cc-by-nc-4.0
 # kuno-kunoichi-v1-DPO-v2-SLERP-7B
 kuno-kunoichi-v1-DPO-v2-SLERP-7B is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-I'm hoping that the result is more robust against errors, as the two models likely implement comparable reasoning at least somewhat differently.
-I've performed some testing with ChatML format prompting using temperature=1.1 and minP=0.03.
 ## Merge Details
 ### Merge Method

 # kuno-kunoichi-v1-DPO-v2-SLERP-7B
 kuno-kunoichi-v1-DPO-v2-SLERP-7B is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+I'm hoping that the result is more robust against errors or when merging due to "denseness", as the two models likely implement comparable reasoning at least somewhat differently.
+I've performed some testing with ChatML format prompting using temperature=1.1 and minP=0.03. The model also supports Alpaca format promtps.
 ## Merge Details
 ### Merge Method