Update README.md
Browse files
README.md
CHANGED
@@ -104,7 +104,7 @@ special_tokens:
|
|
104 |
|
105 |
Tess-3-7B is a finetuned version of the Mistral-7B-v0.3 base model. This version is the first phase of the final Tess-3 model, and have been trained with supervised fine-tuning (SFT) on a curated dataset of ~500K samples. The total SFT dataset contains about 1B tokens.
|
106 |
|
107 |
-
|
108 |
|
109 |
|
110 |
# Sample code to run inference
|
|
|
104 |
|
105 |
Tess-3-7B is a finetuned version of the Mistral-7B-v0.3 base model. This version is the first phase of the final Tess-3 model, and have been trained with supervised fine-tuning (SFT) on a curated dataset of ~500K samples. The total SFT dataset contains about 1B tokens.
|
106 |
|
107 |
+
This model has 32K context length.
|
108 |
|
109 |
|
110 |
# Sample code to run inference
|