rajora commited on
Commit
0f3f7de
1 Parent(s): 1074690

End of training

Browse files
README.md CHANGED
@@ -3,8 +3,6 @@ license: apache-2.0
3
  base_model: distilbert-base-multilingual-cased
4
  tags:
5
  - generated_from_trainer
6
- metrics:
7
- - accuracy
8
  model-index:
9
  - name: distilbert-multilingual-sentiment
10
  results: []
@@ -17,8 +15,13 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.3550
21
- - Accuracy: 0.7609
 
 
 
 
 
22
 
23
  ## Model description
24
 
@@ -43,18 +46,7 @@ The following hyperparameters were used during training:
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 5
47
-
48
- ### Training results
49
-
50
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
- |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
- | 0.7301 | 1.0 | 1065 | 0.7257 | 0.6952 |
53
- | 0.4373 | 2.0 | 2130 | 0.6549 | 0.7450 |
54
- | 0.3726 | 3.0 | 3195 | 0.7610 | 0.7473 |
55
- | 0.1781 | 4.0 | 4260 | 1.0566 | 0.7581 |
56
- | 0.0609 | 5.0 | 5325 | 1.3550 | 0.7609 |
57
-
58
 
59
  ### Framework versions
60
 
 
3
  base_model: distilbert-base-multilingual-cased
4
  tags:
5
  - generated_from_trainer
 
 
6
  model-index:
7
  - name: distilbert-multilingual-sentiment
8
  results: []
 
15
 
16
  This model is a fine-tuned version of [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - eval_loss: 1.3619
19
+ - eval_accuracy: 0.7435
20
+ - eval_runtime: 25.7389
21
+ - eval_samples_per_second: 82.715
22
+ - eval_steps_per_second: 5.206
23
+ - epoch: 6.0
24
+ - step: 6390
25
 
26
  ## Model description
27
 
 
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 8
 
 
 
 
 
 
 
 
 
 
 
50
 
51
  ### Framework versions
52
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5b6e84cbc602d4cf6deb96117f6d271ecb35a420be3e5b05924c6dcbbe0c035e
3
  size 541320452
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4d83f472f6f700f2f457e7a6d076530689105883b45b4ec9e494349e0565b26
3
  size 541320452
tmp-checkpoint-6500/config.json ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "distilbert-base-multilingual-cased",
3
+ "activation": "gelu",
4
+ "architectures": [
5
+ "DistilBertForSequenceClassification"
6
+ ],
7
+ "attention_dropout": 0.1,
8
+ "dim": 768,
9
+ "dropout": 0.2,
10
+ "hidden_dim": 3072,
11
+ "id2label": {
12
+ "0": "Negative",
13
+ "1": "Neutral",
14
+ "2": "Positive"
15
+ },
16
+ "initializer_range": 0.02,
17
+ "label2id": {
18
+ "Negative": 0,
19
+ "Neutral": 1,
20
+ "Positive": 2
21
+ },
22
+ "max_position_embeddings": 512,
23
+ "model_type": "distilbert",
24
+ "n_heads": 12,
25
+ "n_layers": 6,
26
+ "output_past": true,
27
+ "pad_token_id": 0,
28
+ "problem_type": "single_label_classification",
29
+ "qa_dropout": 0.1,
30
+ "seq_classif_dropout": 0.2,
31
+ "sinusoidal_pos_embds": false,
32
+ "tie_weights_": true,
33
+ "torch_dtype": "float32",
34
+ "transformers_version": "4.36.2",
35
+ "vocab_size": 119547
36
+ }
tmp-checkpoint-6500/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4d83f472f6f700f2f457e7a6d076530689105883b45b4ec9e494349e0565b26
3
+ size 541320452
tmp-checkpoint-6500/optimizer.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c09aa06362d3f294bafdb75aba0cf291fb991d864934bbb3055758069561a2cc
3
+ size 368717824
tmp-checkpoint-6500/training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f63087a3112377d66ce3d82b41fc66701944b06d4a0637eac91ef5752e0006b2
3
+ size 4219