README.md · xtuner/Llama-2-7b-qlora-msagent-react at 8cd6fc18b82124df856c237d47bb27f232a8afd6

metadata

library_name: peft
pipeline_tag: conversational
base_model: meta-llama/Llama-2-7b-hf

Model

Llama-2-7b-qlora-msagent-react is fine-tuned from Llama-2-7b with MSAgent-Bench dataset by XTuner.

pip install xtuner

xtuner chat meta-llama/Llama-2-7b-hf --adapter xtuner/Llama-2-7b-qlora-msagent-react --lagent

Use the following command to quickly reproduce the fine-tuning results.

NPROC_PER_NODE=8 xtuner train llama2_7b_qlora_msagent_react_e3_gpu8