BSC-LT
/

salamandra-7b-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mapama247 commited on Sep 30, 2024

Commit

e57d827

·

verified ·

1 Parent(s): 5137c6e

Update README.md

Files changed (1) hide show

README.md +0 -4

README.md CHANGED Viewed

@@ -189,8 +189,6 @@ Using this template, each turn is preceded by a `<|im_start|>` delimiter and the
 ## Data
-## Data
 ### Pretraining Data
 The training corpus consists of 2.4 trillion tokens, including 35 European languages and 92 programming languages. It amounts to a total of 33TB of pre-processed text.
@@ -591,8 +589,6 @@ The dataset does not allow for external contributions.
 </details>
----
 ### Finetuning Data
 This instruction-tuned variant has been trained with a mixture of 276k English, Spanish, and Catalan multi-turn instructions gathered from open datasets:

 ## Data
 ### Pretraining Data
 The training corpus consists of 2.4 trillion tokens, including 35 European languages and 92 programming languages. It amounts to a total of 33TB of pre-processed text.
 </details>
 ### Finetuning Data
 This instruction-tuned variant has been trained with a mixture of 276k English, Spanish, and Catalan multi-turn instructions gathered from open datasets: