AINovice2005
/

ElEmperador

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AINovice2005 commited on 17 days ago

Commit

1bee25d

•

1 Parent(s): 6ae5393

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ tags:
 ---
-<h1 style="font-size: 2em;">Presenting ElEmperador.</h1>
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64e8ea3892d9db9a93580fe3/gkDcpIxRCjBlmknN_jzWN.png)
@@ -29,11 +29,11 @@ The argilla/ultrafeedback-binarized-preferences-cleaned dataset was used, albeit
 # Evals:
-BLEU:0.0209
 # Conclusion and Model Recipe.
-ORPO is a viable RLHF algorithm to improve the performance of your models than SFT finetuning. It also helps in aligning the model’s outputs more closely with human preferences,
 leading to more user-friendly and acceptable results.
 The model recipe: [ https://github.com/ParagEkbote/El-Emperador_ModelRecipe]

 ---
+<h1 style="font-size: 2em;">ElEmperador.</h1>
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64e8ea3892d9db9a93580fe3/gkDcpIxRCjBlmknN_jzWN.png)
 # Evals:
+BLEU:0.209
 # Conclusion and Model Recipe.
+ORPO is a viable RLHF algorithm to improve the performance of your models than SFT finetuning. It also helps in aligning the model’s outputs more closely with human preferences,
 leading to more user-friendly and acceptable results.
 The model recipe: [ https://github.com/ParagEkbote/El-Emperador_ModelRecipe]