Triangle104
/

Mistral-Small-Drummer-22B-Q5_K_M-GGUF

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 4 days ago

Commit

2286514

•

1 Parent(s): 9ded2b5

Update README.md

Files changed (1) hide show

README.md +33 -0

README.md CHANGED Viewed

@@ -111,6 +111,39 @@ model-index:
 This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
+---
+Model details:
+-
+mistralai/Mistral-Small-Instruct-2409 finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.
+Method
+ORPO tuned with 2xA40 on RunPod for 1 epoch.
+learning_rate=4e-6,
+lr_scheduler_type="linear",
+beta=0.1,
+per_device_train_batch_size=4,
+per_device_eval_batch_size=4,
+gradient_accumulation_steps=8,
+optim="paged_adamw_8bit",
+num_train_epochs=1,
+Dataset was prepared using Mistral-Small Instruct format.
+Fine-tune Llama 3 with ORPO
+Open LLM Leaderboard Evaluation Results
+Detailed results can be found here
+Metric 	Value
+Avg. 	29.45
+IFEval (0-Shot) 	63.31
+BBH (3-Shot) 	40.12
+MATH Lvl 5 (4-Shot) 	16.69
+GPQA (0-shot) 	12.42
+MuSR (0-shot) 	9.80
+MMLU-PRO (5-shot) 	34.39
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)