am_llama3_dpo / README.md
simonbutt's picture
Update README.md
2b0ac10 verified
|
raw
history blame contribute delete
No virus
800 Bytes
---
language:
- en
- am
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- trl
- sft
base_model: unsloth/llama-3-8b-bnb-4bit
datasets:
- iocuydi/amharic-alpaca
- iocuydi/amharic-dolly-15k
---
# Llama3 Amharic DPO
[Amharic Llama3 8B Alpaca](simonbutt/am_llama3_alpaca) further DPO tuned on an amharic translated dolly-15k [dataset](https://huggingface.co./datasets/iocuydi/amharic-dolly-15k) to always respond in Amharic.
Very token inefficient.
- **Developed by:** simonbutt
- **License:** apache-2.0
- **Finetuned from model:**
- unsloth/llama-3-8b-bnb-4bit
- simonbutt/am_llama3_alpaca
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)