Papers
arxiv:2106.16163

The MultiBERTs: BERT Reproductions for Robustness Analysis

Published on Jun 30, 2021
Authors:
,
,
,
,
,
,
,
,
,
,
,

Abstract

Experiments with pre-trained models such as BERT are often based on a single checkpoint. While the conclusions drawn apply to the artifact tested in the experiment (i.e., the particular instance of the model), it is not always clear whether they hold for the more general procedure which includes the architecture, <PRE_TAG>training data</POST_TAG>, initialization scheme, and loss function. Recent work has shown that repeating the pre-training process can lead to substantially different performance, suggesting that an alternate strategy is needed to make principled statements about procedures. To enable researchers to draw more robust conclusions, we introduce the Multi<PRE_TAG>BERTs</POST_TAG>, a set of 25 <PRE_TAG>BERT-Base</POST_TAG> checkpoints, trained with similar hyper-parameters as the original BERT model but differing in random weight initialization and shuffling of <PRE_TAG>training data</POST_TAG>. We also define the Multi-Bootstrap, a non-parametric bootstrap method for statistical inference designed for settings where there are multiple pre-trained models and limited test data. To illustrate our approach, we present a case study of gender bias in coreference resolution, in which the Multi-Bootstrap lets us measure effects that may not be detected with a single checkpoint. We release our models and statistical library along with an additional set of 140 intermediate checkpoints captured during pre-training to facilitate research on learning dynamics.

Community

Sign up or log in to comment

Models citing this paper 340

Browse 340 models citing this paper

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2106.16163 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2106.16163 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.