InferenceIllusionist
/

Excalibur-7b-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on Mar 28, 2024

Commit

85478d1

·

verified ·

1 Parent(s): a0ebbc7

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ An initial foray into the world of fine-tuning. The goal of this release was to
 ### Notes & Methodology
 * [Excalibur-7b](https://huggingface.co/InferenceIllusionist/Excalibur-7b) fine-tuned with Direct Preference Optimization (DPO) using Intel/orca_dpo_pairs
 * This is a quick experiment to determine the impact of DPO finetuning on the original base model
-* Executed for a little over an hour on a single A100

 ### Notes & Methodology
 * [Excalibur-7b](https://huggingface.co/InferenceIllusionist/Excalibur-7b) fine-tuned with Direct Preference Optimization (DPO) using Intel/orca_dpo_pairs
 * This is a quick experiment to determine the impact of DPO finetuning on the original base model
+* Ran for a little over an hour on a single A100