mcanoglu
/

bigcode-starcoderbase-1b-finetuned-defect-detection

@@ -4,7 +4,6 @@ tags:
 - generated_from_trainer
 metrics:
 - accuracy
-- f1
 - precision
 - recall
 model-index:
@@ -19,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5149
-- Accuracy: 0.7523
-- F1: 0.7482
-- Precision: 0.7430
-- Recall: 0.7533
 ## Model description
@@ -43,28 +42,30 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 2
-- eval_batch_size: 2
 - seed: 4711
-- gradient_accumulation_steps: 16
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| 0.7604        | 1.0   | 996  | 0.5379          | 0.7144   | 0.6627 | 0.7829    | 0.5745 |
-| 0.4649        | 2.0   | 1992 | 0.4524          | 0.7480   | 0.7585 | 0.7129    | 0.8104 |
-| 0.318         | 3.0   | 2988 | 0.5149          | 0.7523   | 0.7482 | 0.7430    | 0.7533 |
 ### Framework versions
-- Transformers 4.36.2
-- Pytorch 2.1.2+cu121
-- Datasets 2.16.1
-- Tokenizers 0.15.0

 - generated_from_trainer
 metrics:
 - accuracy
 - precision
 - recall
 model-index:
 This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9591
+- Accuracy: 0.7666
+- Roc Auc: 0.7662
+- Precision: 0.7657
+- Recall: 0.7523
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 4711
+- gradient_accumulation_steps: 4
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy | Roc Auc | Precision | Recall |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:-------:|:---------:|:------:|
+| 0.7596        | 1.0   | 996  | 0.5406          | 0.6852   | 0.6897  | 0.6264    | 0.8813 |
+| 0.4855        | 2.0   | 1993 | 0.4691          | 0.7377   | 0.7396  | 0.6954    | 0.8237 |
+| 0.3547        | 3.0   | 2989 | 0.4832          | 0.7480   | 0.7479  | 0.7410    | 0.7441 |
+| 0.2463        | 4.0   | 3986 | 0.5966          | 0.7628   | 0.7646  | 0.7196    | 0.8428 |
+| 0.1633        | 5.0   | 4980 | 0.9591          | 0.7666   | 0.7662  | 0.7657    | 0.7523 |
 ### Framework versions
+- Transformers 4.37.2
+- Pytorch 2.2.0+cu121
+- Datasets 2.17.1
+- Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e11dcea9a5c14ef2b08a21b7b13adc4cac323b81829b3216840bcac2a158fe38
 size 4548876216

 version https://git-lfs.github.com/spec/v1
+oid sha256:2127f62c21723d8676fa2b820ff6f99a031af22d0564351b7e6ac1b760951084
 size 4548876216

runs/Feb21_06-14-19_nglczrkt3t/events.out.tfevents.1708496059.nglczrkt3t.174.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ade619510e9d9bd0b3e3e7870a6a474b667ebea39b87b66de94b470f69c64474
-size 8690

 version https://git-lfs.github.com/spec/v1
+oid sha256:c28a99e931174c5a3c3d3ab70bc0dc0014b9a441d93d5cf4d87abdf02f0c6b35
+size 9044