README.md · Someshfengde/llama-3-instruction-tuned-AIMO at main

Instruction Tuning LLAMA3

This repo uses the torchtune for instruction tuning the llama3 pretrained model on mathematical tasks using LORA.

> pip install poetry 
> poetry install

Further commands over shell terminal

tune download meta-llama/Meta-Llama-3-8B \
--output-dir llama3-8b-hf \
--hf-token <HF_TOKEN>

To start instruction tuning with lora and torchtune

tune run lora_finetune_single_device --config ./lora_finetune_single_device.yaml

tune run quantize --config ./quantization_config.yaml

tune run generate --config ./generation_config.yaml \
prompt="what is 2 + 2."

To run evaluations

tune run eleuther_eval --config ./eval_config.yaml