Edit model card

farsi_lastname_classifier_bert

This model is trained to classify Iranian last names. To use it, type a last name in the space provided on the right and then click on "compute". The model computes probability of the last name being Persian. The compute takes a few seconds to load for the first try (because it needs to load the model first). Subsequent attempt should take only milliseconds. In practice the model can compute the results for an entire batch of data (last names) in a fraction of a second.

It achieves the following results on the evaluation set:

  • Loss: 0.0863
  • Accuracy: 0.976

Model description

Model is based on Bert ("bert-base-cased")

The following hyperparameters were used during training:

  • learning_rate: 1e-05
  • train_batch_size: 128
  • eval_batch_size: 256
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 10
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Accuracy
No log 1.0 12 0.6325 0.588
No log 2.0 24 0.3414 0.952
No log 3.0 36 0.2496 0.97
No log 4.0 48 0.1674 0.976
No log 5.0 60 0.1160 0.976
No log 6.0 72 0.0917 0.972
No log 7.0 84 0.0896 0.974
No log 8.0 96 0.0874 0.974
No log 9.0 108 0.0869 0.974
No log 10.0 120 0.0863 0.976

Framework versions

  • Transformers 4.24.0
  • Pytorch 1.12.1+cu113
  • Datasets 2.6.1
  • Tokenizers 0.13.2
Downloads last month
14
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.