Sparse BERT base model (uncased)

Pretrained model pruned to 70% sparsity. The model is a pruned version of the BERT base model.

Intended Use

The model can be used for fine-tuning to downstream tasks with sparsity already embeded to the model. To keep the sparsity a mask should be added to each sparse weight blocking the optimizer from updating the zeros.

Downloads last month: 136

Inference Providers NEW

Fill-Mask

This model is not currently available via any of the supported Inference Providers.

Collection including Intel/bert-base-uncased-sparse-70-unstructured

BERT

Collection

BERT models of varying flavors • 26 items • Updated Aug 23, 2024