New version with explicit predicate marking

Browse files

Files changed (6) hide show

README.md +72 -73
config.json +4 -2
model.safetensors +3 -0
tokenizer.json +6 -1
tokenizer_config.json +42 -0
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -13,69 +13,69 @@ should probably proofread and complete it, then remove this comment. -->
 # rubert-electra-srl
-This model is a fine-tuned version of [ai-forever/ruElectra-medium](https://huggingface.co/ai-forever/ruElectra-medium) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1471
-- Addressee Precision: 0.9583
-- Addressee Recall: 0.9020
-- Addressee F1: 0.9293
-- Addressee Number: 51
-- Benefactive Precision: 0.8
-- Benefactive Recall: 0.25
-- Benefactive F1: 0.3810
-- Benefactive Number: 16
-- Causator Precision: 0.8971
-- Causator Recall: 0.8714
-- Causator F1: 0.8841
-- Causator Number: 70
-- Cause Precision: 0.6466
-- Cause Recall: 0.7353
-- Cause F1: 0.6881
-- Cause Number: 102
-- Contrsubject Precision: 0.832
-- Contrsubject Recall: 0.7879
-- Contrsubject F1: 0.8093
-- Contrsubject Number: 132
-- Deliberative Precision: 0.6269
-- Deliberative Recall: 0.84
-- Deliberative F1: 0.7179
-- Deliberative Number: 50
 - Destinative Precision: 1.0
-- Destinative Recall: 0.3871
-- Destinative F1: 0.5581
-- Destinative Number: 31
-- Directivefinal Precision: 0.5455
-- Directivefinal Recall: 0.6667
-- Directivefinal F1: 0.6
-- Directivefinal Number: 9
-- Experiencer Precision: 0.8669
-- Experiencer Recall: 0.8609
-- Experiencer F1: 0.8639
-- Experiencer Number: 726
-- Instrument Precision: 0.5
-- Instrument Recall: 0.3333
-- Instrument F1: 0.4
-- Instrument Number: 9
 - Limitative Precision: 0.0
 - Limitative Recall: 0.0
 - Limitative F1: 0.0
-- Limitative Number: 4
-- Object Precision: 0.8676
-- Object Recall: 0.8703
-- Object F1: 0.8689
-- Object Number: 1611
-- Overall Precision: 0.8515
-- Overall Recall: 0.8467
-- Overall F1: 0.8491
-- Overall Accuracy: 0.9687
-- Directiveinitial Recall: 0.0
-- Directiveinitial Number: 0.0
-- Directiveinitial Precision: 0.0
-- Directiveinitial F1: 0.0
-- Mediative Recall: 0.0
 - Mediative Number: 0.0
-- Mediative Precision: 0.0
 - Mediative F1: 0.0
 ## Model description
@@ -94,31 +94,30 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.000261433658985083
 - train_batch_size: 1
 - eval_batch_size: 1
-- seed: 510754
-- gradient_accumulation_steps: 8
-- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_ratio: 0.3
-- num_epochs: 5
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Addressee Precision | Addressee Recall | Addressee F1 | Addressee Number | Benefactive Precision | Benefactive Recall | Benefactive F1 | Benefactive Number | Causator Precision | Causator Recall | Causator F1 | Causator Number | Cause Precision | Cause Recall | Cause F1 | Cause Number | Contrsubject Precision | Contrsubject Recall | Contrsubject F1 | Contrsubject Number | Deliberative Precision | Deliberative Recall | Deliberative F1 | Deliberative Number | Destinative Precision | Destinative Recall | Destinative F1 | Destinative Number | Directivefinal Precision | Directivefinal Recall | Directivefinal F1 | Directivefinal Number | Experiencer Precision | Experiencer Recall | Experiencer F1 | Experiencer Number | Instrument Precision | Instrument Recall | Instrument F1 | Instrument Number | Limitative Precision | Limitative Recall | Limitative F1 | Limitative Number | Object Precision | Object Recall | Object F1 | Object Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy | Directiveinitial Recall | Directiveinitial Number | Directiveinitial Precision | Directiveinitial F1 | Mediative Recall | Mediative Number | Mediative Precision | Mediative F1 |
-|:-------------:|:-----:|:----:|:---------------:|:-------------------:|:----------------:|:------------:|:----------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------:|:---------------:|:-----------:|:---------------:|:---------------:|:------------:|:--------:|:------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------------:|:---------------------:|:-----------------:|:---------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:----------------:|:-------------:|:---------:|:-------------:|:-----------------:|:--------------:|:----------:|:----------------:|:-----------------------:|:-----------------------:|:--------------------------:|:-------------------:|:----------------:|:----------------:|:-------------------:|:------------:|
-| 0.2154        | 1.0   | 763  | 0.2074          | 0.6842              | 0.5098           | 0.5843       | 51               | 0.0                   | 0.0                | 0.0            | 16                 | 0.1946             | 0.8286          | 0.3152      | 70              | 1.0             | 0.0098       | 0.0194   | 102          | 0.2                    | 0.0076              | 0.0146          | 132                 | 0.0                    | 0.0                 | 0.0             | 50                  | 0.0                   | 0.0                | 0.0            | 31                 | 0.0                      | 0.0                   | 0.0               | 9                     | 0.6747                | 0.7713             | 0.7198         | 726                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.8199           | 0.7263        | 0.7702    | 1611          | 0.6987            | 0.6460         | 0.6713     | 0.9433           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
-| 0.2294        | 2.0   | 1526 | 0.2028          | 0.7460              | 0.9216           | 0.8246       | 51               | 0.0                   | 0.0                | 0.0            | 16                 | 0.0                | 0.0             | 0.0         | 70              | 0.3333          | 0.0098       | 0.0190   | 102          | 0.7791                 | 0.5076              | 0.6147          | 132                 | 0.22                   | 0.88                | 0.352           | 50                  | 0.0                   | 0.0                | 0.0            | 31                 | 0.6667                   | 0.6667                | 0.6667            | 9                     | 0.8822                | 0.6708             | 0.7621         | 726                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.7332           | 0.7914        | 0.7612    | 1611          | 0.7255            | 0.6855         | 0.7050     | 0.9417           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
-| 0.132         | 3.0   | 2290 | 0.1485          | 0.7188              | 0.9020           | 0.8          | 51               | 0.0                   | 0.0                | 0.0            | 16                 | 0.6854             | 0.8714          | 0.7673      | 70              | 0.4079          | 0.3039       | 0.3483   | 102          | 0.6562                 | 0.7955              | 0.7192          | 132                 | 0.5263                 | 0.4                 | 0.4545          | 50                  | 0.0                   | 0.0                | 0.0            | 31                 | 0.6                      | 0.6667                | 0.6316            | 9                     | 0.8289                | 0.8609             | 0.8446         | 726                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.8013           | 0.8610        | 0.8300    | 1611          | 0.7806            | 0.8115         | 0.7957     | 0.9574           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
-| 0.0748        | 4.0   | 3053 | 0.1382          | 0.9038              | 0.9216           | 0.9126       | 51               | 0.1905                | 0.25               | 0.2162         | 16                 | 0.9104             | 0.8714          | 0.8905      | 70              | 0.5859          | 0.7353       | 0.6522   | 102          | 0.825                  | 0.75                | 0.7857          | 132                 | 0.4875                 | 0.78                | 0.6             | 50                  | 0.0                   | 0.0                | 0.0            | 31                 | 0.4615                   | 0.6667                | 0.5455            | 9                     | 0.9033                | 0.8237             | 0.8617         | 726                | 0.4                  | 0.2222            | 0.2857        | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.8468           | 0.8678        | 0.8571    | 1611          | 0.8321            | 0.8285         | 0.8303     | 0.9659           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
-| 0.0504        | 5.0   | 3815 | 0.1471          | 0.9583              | 0.9020           | 0.9293       | 51               | 0.8                   | 0.25               | 0.3810         | 16                 | 0.8971             | 0.8714          | 0.8841      | 70              | 0.6466          | 0.7353       | 0.6881   | 102          | 0.832                  | 0.7879              | 0.8093          | 132                 | 0.6269                 | 0.84                | 0.7179          | 50                  | 1.0                   | 0.3871             | 0.5581         | 31                 | 0.5455                   | 0.6667                | 0.6               | 9                     | 0.8669                | 0.8609             | 0.8639         | 726                | 0.5                  | 0.3333            | 0.4           | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.8676           | 0.8703        | 0.8689    | 1611          | 0.8515            | 0.8467         | 0.8491     | 0.9687           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
 ### Framework versions
-- Transformers 4.33.2
-- Pytorch 2.0.1+cu117
-- Datasets 2.14.5
-- Tokenizers 0.13.3

 # rubert-electra-srl
+This model is a fine-tuned version of [ai-forever/ruElectra-medium](https://huggingface.co/ai-forever/ruElectra-medium) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0564
+- Addressee Precision: 0.8710
+- Addressee Recall: 0.9153
+- Addressee F1: 0.8926
+- Addressee Number: 59
+- Benefactive Precision: 0.0
+- Benefactive Recall: 0.0
+- Benefactive F1: 0.0
+- Benefactive Number: 8
+- Causator Precision: 0.9007
+- Causator Recall: 0.9379
+- Causator F1: 0.9189
+- Causator Number: 145
+- Cause Precision: 0.8491
+- Cause Recall: 0.7895
+- Cause F1: 0.8182
+- Cause Number: 114
+- Contrsubject Precision: 0.872
+- Contrsubject Recall: 0.9008
+- Contrsubject F1: 0.8862
+- Contrsubject Number: 121
+- Deliberative Precision: 0.7439
+- Deliberative Recall: 0.9385
+- Deliberative F1: 0.8299
+- Deliberative Number: 65
 - Destinative Precision: 1.0
+- Destinative Recall: 0.5238
+- Destinative F1: 0.6875
+- Destinative Number: 21
+- Directivefinal Precision: 1.0
+- Directivefinal Recall: 0.7
+- Directivefinal F1: 0.8235
+- Directivefinal Number: 10
+- Experiencer Precision: 0.9132
+- Experiencer Recall: 0.9374
+- Experiencer F1: 0.9252
+- Experiencer Number: 1055
+- Instrument Precision: 0.8409
+- Instrument Recall: 0.7255
+- Instrument F1: 0.7789
+- Instrument Number: 51
 - Limitative Precision: 0.0
 - Limitative Recall: 0.0
 - Limitative F1: 0.0
+- Limitative Number: 3
+- Object Precision: 0.9449
+- Object Recall: 0.9389
+- Object F1: 0.9419
+- Object Number: 1898
+- Overall Precision: 0.9210
+- Overall Recall: 0.9228
+- Overall F1: 0.9219
+- Overall Accuracy: 0.9855
 - Mediative Number: 0.0
 - Mediative F1: 0.0
+- Mediative Precision: 0.0
+- Mediative Recall: 0.0
+- Directiveinitial Number: 0.0
+- Directiveinitial F1: 0.0
+- Directiveinitial Precision: 0.0
+- Directiveinitial Recall: 0.0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.00016666401556632117
 - train_batch_size: 1
 - eval_batch_size: 1
+- seed: 708526
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 4
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.21
+- num_epochs: 3
+- mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Addressee Precision | Addressee Recall | Addressee F1 | Addressee Number | Benefactive Precision | Benefactive Recall | Benefactive F1 | Benefactive Number | Causator Precision | Causator Recall | Causator F1 | Causator Number | Cause Precision | Cause Recall | Cause F1 | Cause Number | Contrsubject Precision | Contrsubject Recall | Contrsubject F1 | Contrsubject Number | Deliberative Precision | Deliberative Recall | Deliberative F1 | Deliberative Number | Destinative Precision | Destinative Recall | Destinative F1 | Destinative Number | Directivefinal Precision | Directivefinal Recall | Directivefinal F1 | Directivefinal Number | Experiencer Precision | Experiencer Recall | Experiencer F1 | Experiencer Number | Instrument Precision | Instrument Recall | Instrument F1 | Instrument Number | Limitative Precision | Limitative Recall | Limitative F1 | Limitative Number | Object Precision | Object Recall | Object F1 | Object Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy | Mediative Number | Mediative F1 | Mediative Precision | Mediative Recall | Directiveinitial Number | Directiveinitial F1 | Directiveinitial Precision | Directiveinitial Recall |
+|:-------------:|:-----:|:----:|:---------------:|:-------------------:|:----------------:|:------------:|:----------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------:|:---------------:|:-----------:|:---------------:|:---------------:|:------------:|:--------:|:------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------------:|:---------------------:|:-----------------:|:---------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:----------------:|:-------------:|:---------:|:-------------:|:-----------------:|:--------------:|:----------:|:----------------:|:----------------:|:------------:|:-------------------:|:----------------:|:-----------------------:|:-------------------:|:--------------------------:|:-----------------------:|
+| 0.574         | 1.0   | 2942 | 0.5853          | 0.0                 | 0.0              | 0.0          | 59               | 0.0                   | 0.0                | 0.0            | 8                  | 0.0                | 0.0             | 0.0         | 145             | 0.0             | 0.0          | 0.0      | 114          | 0.0                    | 0.0                 | 0.0             | 121                 | 0.0                    | 0.0                 | 0.0             | 65                  | 0.0                   | 0.0                | 0.0            | 21                 | 0.0                      | 0.0                   | 0.0               | 10                    | 0.0                   | 0.0                | 0.0            | 1055               | 0.0                  | 0.0               | 0.0           | 51                | 0.0                  | 0.0               | 0.0           | 3                 | 0.0              | 0.0           | 0.0       | 1898          | 0.0               | 0.0            | 0.0        | 0.8893           | 0.0              | 0.0          | 0.0                 | 0.0              | 0.0                     | 0.0                 | 0.0                        | 0.0                     |
+| 0.1625        | 2.0   | 5884 | 0.1573          | 0.5714              | 0.8136           | 0.6713       | 59               | 0.0                   | 0.0                | 0.0            | 8                  | 0.6966             | 0.8552          | 0.7678      | 145             | 0.3186          | 0.6316       | 0.4235   | 114          | 0.6875                 | 0.4545              | 0.5473          | 121                 | 0.0                    | 0.0                 | 0.0             | 65                  | 0.0                   | 0.0                | 0.0            | 21                 | 0.0                      | 0.0                   | 0.0               | 10                    | 0.8504                | 0.8246             | 0.8373         | 1055               | 0.4769               | 0.6078            | 0.5345        | 51                | 0.0                  | 0.0               | 0.0           | 3                 | 0.8923           | 0.8161        | 0.8525    | 1898          | 0.8104            | 0.7744         | 0.7920     | 0.9634           | 0.0              | 0.0          | 0.0                 | 0.0              | 0.0                     | 0.0                 | 0.0                        | 0.0                     |
+| 0.0838        | 3.0   | 8826 | 0.0564          | 0.8710              | 0.9153           | 0.8926       | 59               | 0.0                   | 0.0                | 0.0            | 8                  | 0.9007             | 0.9379          | 0.9189      | 145             | 0.8491          | 0.7895       | 0.8182   | 114          | 0.872                  | 0.9008              | 0.8862          | 121                 | 0.7439                 | 0.9385              | 0.8299          | 65                  | 1.0                   | 0.5238             | 0.6875         | 21                 | 1.0                      | 0.7                   | 0.8235            | 10                    | 0.9132                | 0.9374             | 0.9252         | 1055               | 0.8409               | 0.7255            | 0.7789        | 51                | 0.0                  | 0.0               | 0.0           | 3                 | 0.9449           | 0.9389        | 0.9419    | 1898          | 0.9210            | 0.9228         | 0.9219     | 0.9855           | 0.0              | 0.0          | 0.0                 | 0.0              | 0.0                     | 0.0                 | 0.0                        | 0.0                     |
 ### Framework versions
+- Transformers 4.42.4
+- Pytorch 2.3.1+cu121
+- Datasets 2.20.0
+- Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -32,7 +32,8 @@
     "18": "B-DirectiveInitial",
     "19": "I-DirectiveInitial",
     "20": "I-Experiencer",
-    "21": "I-Cause"
   },
   "initializer_range": 0.02,
   "intermediate_size": 2304,
@@ -51,6 +52,7 @@
     "B-Limitative": 14,
     "B-Mediative": 16,
     "B-Object": 1,
     "I-Cause": 21,
     "I-ContrSubject": 11,
     "I-Deliberative": 13,
@@ -72,7 +74,7 @@
   "summary_type": "first",
   "summary_use_proj": true,
   "torch_dtype": "float32",
-  "transformers_version": "4.33.2",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 64000

     "18": "B-DirectiveInitial",
     "19": "I-DirectiveInitial",
     "20": "I-Experiencer",
+    "21": "I-Cause",
+    "22": "I-Causator"
   },
   "initializer_range": 0.02,
   "intermediate_size": 2304,
     "B-Limitative": 14,
     "B-Mediative": 16,
     "B-Object": 1,
+    "I-Causator": 22,
     "I-Cause": 21,
     "I-ContrSubject": 11,
     "I-Deliberative": 13,
   "summary_type": "first",
   "summary_use_proj": true,
   "torch_dtype": "float32",
+  "transformers_version": "4.42.4",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 64000

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8dc983e23ebf1a46a93b9ffb6bb8dfcc4f96b632c2282ab78edd817e53106b5c
+size 340184276

tokenizer.json CHANGED Viewed

@@ -1,6 +1,11 @@
 {
   "version": "1.0",
-  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 2048,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
   "padding": null,
   "added_tokens": [
     {

tokenizer_config.json CHANGED Viewed

@@ -1,4 +1,46 @@
 {
   "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
   "do_basic_tokenize": true,

 {
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "3": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "4": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
   "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
   "do_basic_tokenize": true,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d6b76fd56499805942ca588f2f290c4c0a3e7c80b80ef2c2b659e065090c0acb
-size 4091

 version https://git-lfs.github.com/spec/v1
+oid sha256:6e293be6da178e0dad7a21799fcc806409ec27abe94b264bcf829fab4f996651
+size 5240