--- datasets: - HuggingFaceH4/ultrafeedback_binarized base_model: - OpenRLHF/Llama-3-8b-sft-mixture --- Base model: [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co./OpenRLHF/Llama-3-8b-sft-mixture) Preference dataset: [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co./datasets/HuggingFaceH4/ultrafeedback_binarized)