distilbert-fa-zwnj-base-finetuned-pquad-pquad
This model is a fine-tuned version of Gholamreza/distilbert-fa-zwnj-base-finetuned-pquad on the pquad dataset.
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 3
- mixed_precision_training: Native AMP
Training results
Framework versions
- Transformers 4.44.2
- Pytorch 2.4.1+cu121
- Datasets 3.0.1
- Tokenizers 0.19.1
Computed Metrics
{'exact': 43.91925777331996,
'f1': 59.06087423686695,
'total': 7976,
'HasAns_exact': 58.41534612176814,
'HasAns_f1': 78.5603891431611,
'HasAns_total': 5995,
'NoAns_exact': 0.05047955577990914,
'NoAns_f1': 0.05047955577990914,
'NoAns_total': 1981,
'best_exact': 43.91925777331996,
'best_exact_thresh': 0.0,
'best_f1': 59.0608742368668,
'best_f1_thresh': 0.0}
- Downloads last month
- 39
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.