sercetexam9
/

PRE-xlnet-large-cased-finetuned-augmentation

+---
+library_name: transformers
+license: mit
+base_model: xlnet-large-cased
+tags:
+- generated_from_trainer
+metrics:
+- f1
+- accuracy
+model-index:
+- name: PRE-xlnet-large-cased-finetuned-augmentation
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# PRE-xlnet-large-cased-finetuned-augmentation
+This model is a fine-tuned version of [xlnet-large-cased](https://huggingface.co/xlnet-large-cased) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.2115
+- F1: 0.5464
+- Roc Auc: 0.7436
+- Accuracy: 0.7278
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 100
+- num_epochs: 20
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1     | Roc Auc | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|:--------:|
+| 0.3168        | 1.0   | 389  | 0.3357          | 0.1362 | 0.5687  | 0.5380   |
+| 0.3382        | 2.0   | 778  | 0.3096          | 0.1411 | 0.5806  | 0.5457   |
+| 0.3417        | 3.0   | 1167 | 0.3022          | 0.1456 | 0.5805  | 0.5470   |
+| 0.3357        | 4.0   | 1556 | 0.2927          | 0.1565 | 0.5737  | 0.5489   |
+| 0.2993        | 5.0   | 1945 | 0.2770          | 0.2407 | 0.6166  | 0.5888   |
+| 0.282         | 6.0   | 2334 | 0.2660          | 0.3136 | 0.6391  | 0.6274   |
+| 0.2473        | 7.0   | 2723 | 0.2469          | 0.3656 | 0.6638  | 0.6564   |
+| 0.2566        | 8.0   | 3112 | 0.2268          | 0.4480 | 0.6943  | 0.6885   |
+| 0.2228        | 9.0   | 3501 | 0.2193          | 0.4335 | 0.6773  | 0.6853   |
+| 0.208         | 10.0  | 3890 | 0.2187          | 0.5049 | 0.7302  | 0.6860   |
+| 0.2031        | 11.0  | 4279 | 0.2065          | 0.5010 | 0.7127  | 0.7040   |
+| 0.1798        | 12.0  | 4668 | 0.2100          | 0.5404 | 0.7428  | 0.7143   |
+| 0.185         | 13.0  | 5057 | 0.2034          | 0.5558 | 0.7553  | 0.7201   |
+| 0.1523        | 14.0  | 5446 | 0.2039          | 0.5487 | 0.7436  | 0.7239   |
+| 0.1447        | 15.0  | 5835 | 0.2093          | 0.5382 | 0.7375  | 0.7284   |
+| 0.1181        | 16.0  | 6224 | 0.2115          | 0.5464 | 0.7436  | 0.7278   |
+### Framework versions
+- Transformers 4.45.1
+- Pytorch 2.4.0
+- Datasets 3.0.1
+- Tokenizers 0.20.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8a75b2fb4ce3998fb0b49ddbea02bb1f670744a2d15986d5b8424394c7ee035a
 size 1445339812

 version https://git-lfs.github.com/spec/v1
+oid sha256:58c78d90acff59938bd5977923023b7fb79b4f1f144ae71e85251e433e4288f1
 size 1445339812