Adding modes, graphs and metadata.
Browse files
README.md
CHANGED
@@ -34,11 +34,11 @@ This model was fine-tuned from the HuggingFace [BERT](https://www.aclweb.org/ant
|
|
34 |
This model is case-insensitive: it does not make a difference between english and English.
|
35 |
|
36 |
## Pruning details
|
37 |
-
A side-effect of the block pruning is that some of the attention heads are completely removed:
|
38 |
|
39 |
Here is a detailed view on how the remaining heads are distributed in the network after pruning.
|
40 |
|
41 |
-
<div class="graph"><script src="/madlag/bert-base-uncased-squadv1-x1.84-f88.7-d36-hybrid-filled-v1/raw/main/model_card/pruning_info.js" id="
|
42 |
|
43 |
## Density plot
|
44 |
|
|
|
34 |
This model is case-insensitive: it does not make a difference between english and English.
|
35 |
|
36 |
## Pruning details
|
37 |
+
A side-effect of the block pruning is that some of the attention heads are completely removed: 48 heads were removed on a total of 144 (33.3%).
|
38 |
|
39 |
Here is a detailed view on how the remaining heads are distributed in the network after pruning.
|
40 |
|
41 |
+
<div class="graph"><script src="/madlag/bert-base-uncased-squadv1-x1.84-f88.7-d36-hybrid-filled-v1/raw/main/model_card/pruning_info.js" id="f7197e6f-ccea-4a42-8fb9-7a8fe7136cf0"></script></div>
|
42 |
|
43 |
## Density plot
|
44 |
|