johannhartmann
commited on
Commit
•
1cea6b4
1
Parent(s):
1dfc973
Update README.md
Browse files
README.md
CHANGED
@@ -21,8 +21,8 @@ base_model:
|
|
21 |
Some of the best german models with 7b parameters as lasered dpo-trained dare_ties merge.
|
22 |
|
23 |
Since the original models based on mistral - three of them on the brilliant german LeoLM/leo-mistral-hessianai-7b - they are reunited in this merged model.
|
24 |
-
Hence the name. To improve result quality they are dpo-trained with a german translation of
|
25 |
-
After that this model got a [laserRMT](https://github.com/cognitivecomputations/laserRMT) treatment.
|
26 |
|
27 |
Wiedervereinigung-7b itself is a [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing) merge of:
|
28 |
* [DiscoResearch/DiscoLM_German_7b_v1](https://huggingface.co/DiscoResearch/DiscoLM_German_7b_v1)
|
@@ -63,7 +63,7 @@ dtype: bfloat16
|
|
63 |
|
64 |
## mt-bench-de
|
65 |
|
66 |
-
Using laser and dpo results
|
67 |
|
68 |
```json
|
69 |
{
|
|
|
21 |
Some of the best german models with 7b parameters as lasered dpo-trained dare_ties merge.
|
22 |
|
23 |
Since the original models based on mistral - three of them on the brilliant german LeoLM/leo-mistral-hessianai-7b - they are reunited in this merged model.
|
24 |
+
Hence the name, no right wing or nationalistic ideas involved :-). To improve the result quality they are dpo-trained with a german translation of intel-orca-dpo using our german fork of [LLaMA-Factory](https://github.com/mayflower/LLaMA-Factory).
|
25 |
+
After that this model got a [laserRMT](https://github.com/cognitivecomputations/laserRMT) treatment with german datasets.
|
26 |
|
27 |
Wiedervereinigung-7b itself is a [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing) merge of:
|
28 |
* [DiscoResearch/DiscoLM_German_7b_v1](https://huggingface.co/DiscoResearch/DiscoLM_German_7b_v1)
|
|
|
63 |
|
64 |
## mt-bench-de
|
65 |
|
66 |
+
Using laser and dpo results seems to help.
|
67 |
|
68 |
```json
|
69 |
{
|