Edit model card

Llama-3.1-8B-Instruct-EI1-2ep-sft

This model is a fine-tuned version of meta-llama/Llama-3.1-8B-Instruct on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.3970

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 6e-06
  • train_batch_size: 1
  • eval_batch_size: 8
  • seed: 42
  • distributed_type: multi-GPU
  • num_devices: 32
  • gradient_accumulation_steps: 2
  • total_train_batch_size: 64
  • total_eval_batch_size: 256
  • optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • num_epochs: 2.0

Training results

Training Loss Epoch Step Validation Loss
No log 0.0562 100 0.5980
No log 0.1124 200 0.5609
No log 0.1685 300 0.5369
No log 0.2247 400 0.5156
0.5582 0.2809 500 0.4955
0.5582 0.3371 600 0.4795
0.5582 0.3933 700 0.4655
0.5582 0.4494 800 0.4522
0.5582 0.5056 900 0.4433
0.448 0.5618 1000 0.4355
0.448 0.6180 1100 0.4295
0.448 0.6742 1200 0.4252
0.448 0.7303 1300 0.4200
0.448 0.7865 1400 0.4159
0.4123 0.8427 1500 0.4124
0.4123 0.8989 1600 0.4098
0.4123 0.9551 1700 0.4075
0.4123 1.0112 1800 0.4086
0.4123 1.0674 1900 0.4075
0.3815 1.1236 2000 0.4069
0.3815 1.1798 2100 0.4054
0.3815 1.2360 2200 0.4043
0.3815 1.2921 2300 0.4029
0.3815 1.3483 2400 0.4022
0.3532 1.4045 2500 0.4012
0.3532 1.4607 2600 0.4002
0.3532 1.5169 2700 0.3996
0.3532 1.5730 2800 0.3986
0.3532 1.6292 2900 0.3982
0.35 1.6854 3000 0.3978
0.35 1.7416 3100 0.3975
0.35 1.7978 3200 0.3971
0.35 1.8539 3300 0.3971
0.35 1.9101 3400 0.3970
0.3468 1.9663 3500 0.3970

Framework versions

  • Transformers 4.43.4
  • Pytorch 2.4.0+cu121
  • Datasets 3.0.1
  • Tokenizers 0.19.1
Downloads last month
430
Safetensors
Model size
8.03B params
Tensor type
F32
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for qfq/Llama-3.1-8B-Instruct-EI1-2ep-sft

Finetuned
(420)
this model
Finetunes
3 models