Train after merging?

by adi-kmt - opened Feb 19

Discussion

adi-kmt

Feb 19

Other than adding a positive prompt, is it necessary to further finetune after merging to an moe?

cloudyu

Owner Feb 19

fine-tune can improve the score again if the dataset is great.
https://huggingface.co./cloudyu/Pluto_24B_DPO_200/blob/main/dpo-metrics.jpg

rookiemango

Mar 20

Thanks for sharing, i would like to know if have you done the sft before doing dpo, and if yes on which dataset? if not, then could you tell me which comparison training you are using during dpo training ? thanks again for you sharing

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment