Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
distily
/
distily_test_attn_mlp
like
0
Follow
Distily Project
2
TensorBoard
Safetensors
wikimedia/wikipedia
Distily
gpt2
bitnet
1.58b
Generated from Trainer
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
55e1a50
distily_test_attn_mlp
/
logs
1 contributor
History:
4 commits
lapp0
Training in progress, step 61875
55e1a50
verified
5 months ago
attn_layer_mapper=all, attn_loss_fn=raw_mse, attn_projector=mlp
Training in progress, step 61875
5 months ago
attn_layer_mapper=last, attn_loss_fn=raw_mse, attn_projector=mlp
Training in progress, step 61875
5 months ago
attn_layer_mapper=last_k_2, attn_loss_fn=raw_mse, attn_projector=mlp
Training in progress, step 61875
5 months ago
attn_layer_mapper=layer-2, attn_loss_fn=raw_mse, attn_projector=mlp
Training in progress, step 61875
5 months ago