gokulsrinivasagan commited on
Commit
2cd2f71
·
verified ·
1 Parent(s): f9b8a64

Model save

Browse files
README.md ADDED
@@ -0,0 +1,96 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model: distilbert-base-uncased
5
+ tags:
6
+ - generated_from_trainer
7
+ metrics:
8
+ - accuracy
9
+ model-index:
10
+ - name: distilbert_base_train_book_v2
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # distilbert_base_train_book_v2
18
+
19
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 1.2005
22
+ - Accuracy: 0.7300
23
+
24
+ ## Model description
25
+
26
+ More information needed
27
+
28
+ ## Intended uses & limitations
29
+
30
+ More information needed
31
+
32
+ ## Training and evaluation data
33
+
34
+ More information needed
35
+
36
+ ## Training procedure
37
+
38
+ ### Training hyperparameters
39
+
40
+ The following hyperparameters were used during training:
41
+ - learning_rate: 0.0001
42
+ - train_batch_size: 160
43
+ - eval_batch_size: 160
44
+ - seed: 10
45
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
+ - lr_scheduler_type: linear
47
+ - lr_scheduler_warmup_steps: 10000
48
+ - num_epochs: 25
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
+ |:-------------:|:-------:|:------:|:---------------:|:--------:|
54
+ | 5.604 | 0.7025 | 10000 | 5.4504 | 0.1650 |
55
+ | 4.6179 | 1.4051 | 20000 | 3.8277 | 0.3758 |
56
+ | 2.353 | 2.1076 | 30000 | 2.0408 | 0.5933 |
57
+ | 2.0739 | 2.8102 | 40000 | 1.7932 | 0.6316 |
58
+ | 1.9345 | 3.5127 | 50000 | 1.6618 | 0.6527 |
59
+ | 1.844 | 4.2153 | 60000 | 1.5829 | 0.6653 |
60
+ | 1.7874 | 4.9178 | 70000 | 1.5248 | 0.6750 |
61
+ | 1.737 | 5.6203 | 80000 | 1.4824 | 0.6819 |
62
+ | 1.7017 | 6.3229 | 90000 | 1.4506 | 0.6876 |
63
+ | 1.6703 | 7.0254 | 100000 | 1.4204 | 0.6921 |
64
+ | 1.6497 | 7.7280 | 110000 | 1.3988 | 0.6961 |
65
+ | 1.6245 | 8.4305 | 120000 | 1.3766 | 0.6996 |
66
+ | 1.6015 | 9.1331 | 130000 | 1.3628 | 0.7019 |
67
+ | 1.5882 | 9.8356 | 140000 | 1.3451 | 0.7052 |
68
+ | 1.5738 | 10.5381 | 150000 | 1.3310 | 0.7076 |
69
+ | 1.563 | 11.2407 | 160000 | 1.3214 | 0.7091 |
70
+ | 1.5473 | 11.9432 | 170000 | 1.3087 | 0.7113 |
71
+ | 1.5364 | 12.6458 | 180000 | 1.2944 | 0.7135 |
72
+ | 1.5257 | 13.3483 | 190000 | 1.2905 | 0.7146 |
73
+ | 1.5164 | 14.0509 | 200000 | 1.2789 | 0.7161 |
74
+ | 1.5071 | 14.7534 | 210000 | 1.2702 | 0.7176 |
75
+ | 1.4972 | 15.4560 | 220000 | 1.2618 | 0.7193 |
76
+ | 1.4915 | 16.1585 | 230000 | 1.2573 | 0.7201 |
77
+ | 1.4824 | 16.8610 | 240000 | 1.2515 | 0.7211 |
78
+ | 1.4748 | 17.5636 | 250000 | 1.2450 | 0.7223 |
79
+ | 1.4686 | 18.2661 | 260000 | 1.2389 | 0.7234 |
80
+ | 1.4649 | 18.9687 | 270000 | 1.2333 | 0.7243 |
81
+ | 1.4566 | 19.6712 | 280000 | 1.2285 | 0.7253 |
82
+ | 1.4529 | 20.3738 | 290000 | 1.2230 | 0.7261 |
83
+ | 1.4451 | 21.0763 | 300000 | 1.2189 | 0.7269 |
84
+ | 1.443 | 21.7788 | 310000 | 1.2136 | 0.7278 |
85
+ | 1.4357 | 22.4814 | 320000 | 1.2100 | 0.7284 |
86
+ | 1.4327 | 23.1839 | 330000 | 1.2068 | 0.7290 |
87
+ | 1.4309 | 23.8865 | 340000 | 1.2040 | 0.7295 |
88
+ | 1.4281 | 24.5890 | 350000 | 1.2005 | 0.7300 |
89
+
90
+
91
+ ### Framework versions
92
+
93
+ - Transformers 4.46.1
94
+ - Pytorch 2.2.0+cu121
95
+ - Datasets 3.1.0
96
+ - Tokenizers 0.20.1
logs/events.out.tfevents.1738095365.ki-g0009.121324.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ff2fc2a7fe214efee9a7f569f789508bf59f9369c19158a8bf335319e5043c03
3
- size 166821
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4e885cea1d7ee30aaf81a31773becd0946cdecd8b65b0822ab12a7b5ca687f1e
3
+ size 169546