catallama
/

CataLlama-v0.2-Instruct-SFT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

laurentiubp commited on Jul 13

Commit

bb7ac06

•

1 Parent(s): 9aca0ab

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -41,13 +41,18 @@ The model shows improved proficiency with the Catalan language while performing
 - *Summarization - both short form and long form*
 - *Sentiment analysis*
 **Model developers** [Laurentiu Petrea](https://www.linkedin.com/in/laurentiupetrea/) based on Llama-3 from Meta.
 **Model Architecture** CataLlama is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and direct preference optimisation (DPO) to align with human preferences for helpfulness and safety.
 **License** The model uses the llama-3 license available at: [https://llama.meta.com/llama3/license](https://llama.meta.com/llama3/license)
 ### Use with transformers

 - *Summarization - both short form and long form*
 - *Sentiment analysis*
 **Model developers** [Laurentiu Petrea](https://www.linkedin.com/in/laurentiupetrea/) based on Llama-3 from Meta.
 **Model Architecture** CataLlama is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and direct preference optimisation (DPO) to align with human preferences for helpfulness and safety.
 **License** The model uses the llama-3 license available at: [https://llama.meta.com/llama3/license](https://llama.meta.com/llama3/license)
+## Benchmarks
+| Model              | CataLlama-v0.1-Instruct-SFT | CataLlama-v0.2-Instruct-SFT     |
+| ------------------ | --------------------------- | ------------------------------- |
+| MMLU 5 shot        | 55.28                       | **59.35**                       |
+| GSM8K cot 8 shot   | 51.63                       | **76.04**                       |
 ### Use with transformers