QuantFactory/Mistral-7B-Instruct-RDPO-GGUF

This is quantized version of princeton-nlp/Mistral-7B-Instruct-RDPO created using llama.cpp

Model Description

This is a model released from the preprint: SimPO: Simple Preference Optimization with a Reference-Free Reward Please refer to our repository for more details.

Downloads last month: 198

GGUF

Model size

7.24B params

Architecture

llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for QuantFactory/Mistral-7B-Instruct-RDPO-GGUF

Base model

princeton-nlp/Mistral-7B-Instruct-RDPO

Quantized

(2)

this model

Collection including QuantFactory/Mistral-7B-Instruct-RDPO-GGUF

Mistral-AI

Collection

Quantized versions of models by mistralai • 19 items • Updated Oct 3 • 5