DeBERTaV3-small-ST-AdaptiveLayer-Norm-ep2 / sentence_bert_config.json
bobox's picture
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
554d487 verified
raw
history blame contribute delete
53 Bytes
{
"max_seq_length": 512,
"do_lower_case": false
}