QuantFactory
/

Mistral-7B-Instruct-RDPO-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

munish0838 commited on May 28

Commit

e751659

•

1 Parent(s): 0c6922a

Create README.md

Files changed (1) hide show

README.md +11 -0

README.md ADDED Viewed

	@@ -0,0 +1,11 @@

+---
+library_name: transformers
+pipeline_tag: text-generation
+base_model: princeton-nlp/Mistral-7B-Instruct-RDPO
+---
+# QuantFactory/Mistral-7B-Instruct-RDPO-GGUF
+This is quantized version of [princeton-nlp/Mistral-7B-Instruct-RDPO](https://huggingface.co/princeton-nlp/Mistral-7B-Instruct-RDPO) created using llama.cpp
+# Model Description
+This is a model released from the preprint: *[SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734)*  Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details.