maciek-pioro
/

Mixtral-8x7B-v0.1-pl

Feature Extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

maciek-pioro commited on Apr 23

Commit

e94af70

•

1 Parent(s): c4eb3b4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ language:
 <!-- Provide a quick summary of what the model is/does. -->
 Mixtral-8x7B-v0.1-pl is a [Mixtral 8x7b](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) model fine-tuned using 2.2B Polish
-tokens selected from the [SpeakLeash](https://speakleash.org/).
 This is, to our knowledge, the first open-weights MoE model fine-tuned on Polish data.
 In order to preserve English capabilities, we include about 600M tokens from the [RedPajama dataset](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T).

 <!-- Provide a quick summary of what the model is/does. -->
 Mixtral-8x7B-v0.1-pl is a [Mixtral 8x7b](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) model fine-tuned using 2.2B Polish
+tokens selected from the [SpeakLeash](https://speakleash.org/) dataset.
 This is, to our knowledge, the first open-weights MoE model fine-tuned on Polish data.
 In order to preserve English capabilities, we include about 600M tokens from the [RedPajama dataset](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T).