mcanoglu
/

deepseek-ai-deepseek-coder-1.3b-base-finetuned-defect-detection

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5530
-- Accuracy: 0.7686
-- F1: 0.7679
-- Precision: 0.7527
-- Recall: 0.7837
 ## Model description
@@ -44,28 +44,30 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 2
-- eval_batch_size: 2
 - seed: 4711
-- gradient_accumulation_steps: 16
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| 0.5818        | 1.0   | 996  | 0.4514          | 0.7510   | 0.7353 | 0.7647    | 0.7081 |
-| 0.3357        | 2.0   | 1992 | 0.4397          | 0.7615   | 0.7695 | 0.7289    | 0.8150 |
-| 0.2098        | 3.0   | 2988 | 0.5530          | 0.7686   | 0.7679 | 0.7527    | 0.7837 |
 ### Framework versions
-- Transformers 4.36.2
-- Pytorch 2.1.2+cu121
-- Datasets 2.16.1
-- Tokenizers 0.15.0

 This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8154
+- Accuracy: 0.7877
+- F1: 0.7861
+- Precision: 0.7736
+- Recall: 0.7991
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 4711
+- gradient_accumulation_steps: 4
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| 0.5701        | 1.0   | 996  | 0.4446          | 0.7417   | 0.7633 | 0.6910    | 0.8525 |
+| 0.3448        | 2.0   | 1993 | 0.4246          | 0.7681   | 0.7490 | 0.7944    | 0.7086 |
+| 0.2305        | 3.0   | 2989 | 0.4693          | 0.7912   | 0.7924 | 0.7701    | 0.8160 |
+| 0.1564        | 4.0   | 3986 | 0.5977          | 0.7836   | 0.7790 | 0.7774    | 0.7806 |
+| 0.1102        | 5.0   | 4980 | 0.8154          | 0.7877   | 0.7861 | 0.7736    | 0.7991 |
 ### Framework versions
+- Transformers 4.38.0
+- Pytorch 2.2.0+cu121
+- Datasets 2.17.1
+- Tokenizers 0.15.2

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b12235b63ac44a8329b7d3a0eb9037cfdc5a58506dab027b79f483b225b35a2c
 size 4986380064

 version https://git-lfs.github.com/spec/v1
+oid sha256:8bed09cde4d68e6bc535c04f10b23a3e26e69a8ff339f713cadb9fc4e0fcec74
 size 4986380064

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:22b5df95184f9bedaacebadeca53ff08038f19e702baf08966b04ac4dcec103e
 size 135308016

 version https://git-lfs.github.com/spec/v1
+oid sha256:5c14f821faea4426fa1d849bdbbb12d5a0247877919215386cae53efef6c2219
 size 135308016