sgugger
/

bert-finetuned-mrpc

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

bert-finetuned-mrpc

This model is a fine-tuned version of bert-base-cased on the GLUE MRPC dataset. It achieves the following results on the evaluation set:

Loss: 0.5152
Accuracy: 0.8603
F1: 0.9032
Combined Score: 0.8818

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
distributed_type: multi-GPU
num_devices: 2
total_train_batch_size: 16
total_eval_batch_size: 16
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 3.0

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1	Combined Score
No log	1.0	230	0.3668	0.8431	0.8881	0.8656
No log	2.0	460	0.3751	0.8578	0.9017	0.8798
0.4264	3.0	690	0.5152	0.8603	0.9032	0.8818

Framework versions

Transformers 4.11.0.dev0
Pytorch 1.8.1+cu111
Datasets 1.10.3.dev0
Tokenizers 0.10.3

Downloads last month: 16

Safetensors

Model size

108M params

Tensor type

I64

·

F32

·

Inference Providers NEW

Text Classification

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for sgugger/bert-finetuned-mrpc

Merges

1 model

Dataset used to train sgugger/bert-finetuned-mrpc

Space using sgugger/bert-finetuned-mrpc 1

Evaluation results

Accuracy on GLUE MRPC
self-reported

0.860
F1 on GLUE MRPC
self-reported

0.903
Accuracy on glue
validation set self-reported

0.860
Precision on glue
validation set self-reported

0.858
Recall on glue
validation set self-reported

0.953
AUC on glue
validation set self-reported

0.926
F1 on glue
validation set self-reported

0.903
loss on glue
validation set self-reported

0.515

View on Papers With Code