Running on Zero 11 π¦Ύπͺπ½ Human Feedback Collector | Meta-Llama-3.1-8B-Instruct | (DPO) LLM, chatbot, human-feedback
Running on Zero 5 π¦Ύπͺπ½ Human Feedback Collector | Meta-Llama-3.1-8B-Instruct | (KTO) LLM, chatbot, human-feedback