Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
hansh
/
hansken_human_hql
like
0
PEFT
TensorBoard
Safetensors
hansh/hansken_hql
llama
alignment-handbook
trl
sft
Generated from Trainer
4-bit precision
bitsandbytes
License:
llama3.1
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
72bcd4c
hansken_human_hql
/
runs
/
Aug30_16-06-25_938RL43
/
events.out.tfevents.1725027260.938RL43.754039.0
Commit History
Training in progress, epoch 8
72bcd4c
verified
hansh
commited on
Aug 30
Training in progress, epoch 6
f9d074b
verified
hansh
commited on
Aug 30
Training in progress, epoch 5
4de1dca
verified
hansh
commited on
Aug 30
Training in progress, epoch 4
13ae892
verified
hansh
commited on
Aug 30
Training in progress, epoch 4
5cbba6e
verified
hansh
commited on
Aug 30
Training in progress, epoch 2
dbd547c
verified
hansh
commited on
Aug 30
Training in progress, epoch 1
9a49130
verified
hansh
commited on
Aug 30
Training in progress, epoch 0
798f58f
verified
hansh
commited on
Aug 30