This model has 1 file scanned as unsafe.
- copy_teacher_modules=_(_lm_head___True)_, hs_layer_mapper=last, hs_loss_fn=mse, hs_weight=1.0
- dataset_subset=default, dataset_uri=distily_c4_multilingual_1M, learning_rate=0.0001, per_device_train_batch_size=4
- dataset_subset=default, dataset_uri=distily_c4_multilingual_1M
- hs_layer_mapper=last, hs_loss_fn=mse, hs_weight=1.0
- learning_rate=0.0001, per_device_train_batch_size=4
-
0 Bytes
-
5.85 MB
LFS
-
578 Bytes
LFS