BSC-LT
/

salamandra-2b-base-fp8

Text Generation

text-generation-inference

Inference Endpoints

🇪🇺 Region: EU

Model card Files Files and versions Community

ferran-espuna commited on 1 day ago

Commit

82e48cd

•

1 Parent(s): fe619a0

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -63,7 +63,23 @@ This model card corresponds to the fp8-quantized version of Salamandra-2b.
 The entire Salamandra family is released under a permissive [Apache 2.0 license]((https://www.apache.org/licenses/LICENSE-2.0)).
-## Additional information
 ### Author
 International Business Machines (IBM).

 The entire Salamandra family is released under a permissive [Apache 2.0 license]((https://www.apache.org/licenses/LICENSE-2.0)).
+The following example code works under ``Python 3.9.16``, ``vllm==0.6.3.post1``, ``torch==2.4.0`` and ``torchvision==0.19.0``, though it should run on
+any current version of the libraries. This is an example of how to create a text completion using the model:
+```
+from vllm import LLM, SamplingParams
+model_name = "BSC-LT/salamandra-2b-base-fp8"
+llm = LLM(model=model_name)
+outputs = llm.generate("El mercat del barri ",
+                       sampling_params=SamplingParams(
+                           temperature=0.5,
+                           max_tokens=200)
+                       )
+print(outputs[0].outputs[0].text)
+```
 ### Author
 International Business Machines (IBM).