mmnga
/

TinyMixtral-x8-Clonebase-7b

Text Generation

Mixture of Experts

text-generation-inference

Model card Files Files and versions Community

mmnga commited on Jan 2

Commit

e11b5cc

•

1 Parent(s): ce6d02e

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ language:
 inference: false
 ---
 # Model Card for TinyMixtral-x8-Clonebase-7b
-This model is based on [TinyLlama-1.1B](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T), converted to a mistral model, and then placed the clone in mixtral.
 **This model was created experimentally for training a small mixtral.**
 **Without Train, the performance of this model is the same as TinyLlama.**
@@ -22,6 +22,10 @@ This model was created with experts=8, but since it is a clone, you can create a
 [tinyllama_to_mixtral_clonebase.ipynb](https://huggingface.co/mmnga/TinyMixtral-x8-Clonebase-7b/blob/main/notebook/tinyllama_to_mixtral_clonebase.ipynb)
 # Usage
 ~~~python
 pip install transformers --upgrade

 inference: false
 ---
 # Model Card for TinyMixtral-x8-Clonebase-7b
+This model is based on [TinyLlama-1.1B](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T), converted to a mistral model, and then placed the clone in mixtral.
 **This model was created experimentally for training a small mixtral.**
 **Without Train, the performance of this model is the same as TinyLlama.**
 [tinyllama_to_mixtral_clonebase.ipynb](https://huggingface.co/mmnga/TinyMixtral-x8-Clonebase-7b/blob/main/notebook/tinyllama_to_mixtral_clonebase.ipynb)
+# History revision
+[main TinyLlama-1.1B-intermediate-step-1431k-3T](https://huggingface.co/mmnga/TinyMixtral-x8-Clonebase-7b)
+[old  TinyLlama-1.1B-intermediate-step-1195k-token-2.5T](https://huggingface.co/mmnga/TinyMixtral-x8-Clonebase-7b/tree/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T)
 # Usage
 ~~~python
 pip install transformers --upgrade