dvs's picture
End of training
f22cf1c
raw
history blame
210 Bytes
{
"epoch": 20.0,
"total_flos": 1.1677328181960376e+18,
"train_loss": 0.13230072516534064,
"train_runtime": 1894.9319,
"train_samples_per_second": 24.149,
"train_steps_per_second": 0.19
}