openGPT-X
/

Teuken-7B-instruct-research-v0.4

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mfromm commited on Oct 25, 2024

Commit

361fd51

·

verified ·

1 Parent(s): ea9a9e2

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -35,8 +35,8 @@ base_model:
 # Model Card for Teuken-7B-instruct-v0.4
-Teuken-7B-base-v0.4 is a 7B parameter multilingual large language model (LLM) pre-trained with 4T tokens within the research project OpenGPT-X.
-Teuken-7B-instruct-v0.4 is an instruction-tuned version of Teuken-7B-base-v0.4.
 ### Model Description
@@ -69,7 +69,7 @@ The model is not intended for use in math and coding tasks.
 <!-- This section is meant to convey both technical and sociotechnical limitations. -->
-Teuken-7B-instruct-v0.4 is an instruction-tuned version of Teuken-7B-base-v0.4 that is not completely free from biases and hallucinations.
 ## How to Get Started with the Model
@@ -135,7 +135,7 @@ This example demonstrates how to load the model and tokenizer, prepare input, ge
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-Teuken-7B-base-v0.4 was pre-trained on 4 trillion tokens of data from publicly available sources.
 The pretraining data has a cutoff of September 2023.
 More information are available in our [preprint](http://arxiv.org/abs/2410.08800).
@@ -177,7 +177,7 @@ More information are available in our [preprint](http://arxiv.org/abs/2410.08800
 ### Training Procedure
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
-Instruction fined tuned version of Teuken-7B-base-v0.4.
 #### Training Hyperparameters

 # Model Card for Teuken-7B-instruct-v0.4
+[Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4) is a 7B parameter multilingual large language model (LLM) pre-trained with 4T tokens within the research project OpenGPT-X.
+Teuken-7B-instruct-v0.4 is an instruction-tuned version of [Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4).
 ### Model Description
 <!-- This section is meant to convey both technical and sociotechnical limitations. -->
+Teuken-7B-instruct-v0.4 is an instruction-tuned version of [Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4) that is not completely free from biases and hallucinations.
 ## How to Get Started with the Model
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4) was pre-trained on 4 trillion tokens of data from publicly available sources.
 The pretraining data has a cutoff of September 2023.
 More information are available in our [preprint](http://arxiv.org/abs/2410.08800).
 ### Training Procedure
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+Instruction fined tuned version of [Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4).
 #### Training Hyperparameters