--- language: - en license: apache-2.0 tags: - text-generation-inference - transformers - unsloth - mistral - trl base_model: unsloth/zephyr-sft-bnb-4bit datasets: - unalignment/toxic-dpo-v0.2 --- # Uploaded model - **Developed by:** akaistormherald - **License:** apache-2.0 - **Finetuned from model :** unsloth/zephyr-sft-bnb-4bit - 4bit DPO training from unalignment/toxic-dpo-v0.2, using unsloth