gokulsrinivasagan commited on
Commit
d13dda2
·
verified ·
1 Parent(s): 2cd2f71

End of training

Browse files
README.md CHANGED
@@ -4,11 +4,23 @@ license: apache-2.0
4
  base_model: distilbert-base-uncased
5
  tags:
6
  - generated_from_trainer
 
 
7
  metrics:
8
  - accuracy
9
  model-index:
10
  - name: distilbert_base_train_book_v2
11
- results: []
 
 
 
 
 
 
 
 
 
 
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,9 +28,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # distilbert_base_train_book_v2
18
 
19
- This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 1.2005
22
  - Accuracy: 0.7300
23
 
24
  ## Model description
 
4
  base_model: distilbert-base-uncased
5
  tags:
6
  - generated_from_trainer
7
+ datasets:
8
+ - gokulsrinivasagan/processed_book_corpus-ld
9
  metrics:
10
  - accuracy
11
  model-index:
12
  - name: distilbert_base_train_book_v2
13
+ results:
14
+ - task:
15
+ name: Masked Language Modeling
16
+ type: fill-mask
17
+ dataset:
18
+ name: gokulsrinivasagan/processed_book_corpus-ld
19
+ type: gokulsrinivasagan/processed_book_corpus-ld
20
+ metrics:
21
+ - name: Accuracy
22
+ type: accuracy
23
+ value: 0.7300233078924688
24
  ---
25
 
26
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
28
 
29
  # distilbert_base_train_book_v2
30
 
31
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the gokulsrinivasagan/processed_book_corpus-ld dataset.
32
  It achieves the following results on the evaluation set:
33
+ - Loss: 1.2024
34
  - Accuracy: 0.7300
35
 
36
  ## Model description
all_results.json ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 25.0,
3
+ "eval_accuracy": 0.7300233078924688,
4
+ "eval_loss": 1.202447533607483,
5
+ "eval_runtime": 143.1897,
6
+ "eval_samples": 120706,
7
+ "eval_samples_per_second": 842.98,
8
+ "eval_steps_per_second": 5.273,
9
+ "perplexity": 3.328252973052478,
10
+ "total_flos": 7.547177208649421e+18,
11
+ "train_loss": 1.849292092869351,
12
+ "train_runtime": 122993.5623,
13
+ "train_samples": 2277342,
14
+ "train_samples_per_second": 462.899,
15
+ "train_steps_per_second": 2.893
16
+ }
eval_results.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 25.0,
3
+ "eval_accuracy": 0.7300233078924688,
4
+ "eval_loss": 1.202447533607483,
5
+ "eval_runtime": 143.1897,
6
+ "eval_samples": 120706,
7
+ "eval_samples_per_second": 842.98,
8
+ "eval_steps_per_second": 5.273,
9
+ "perplexity": 3.328252973052478
10
+ }
logs/events.out.tfevents.1738218533.ki-g0009.121324.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8489158806c095b7ea29528202a5b058b0bb8aae32ca47a806d2edcdc6e30834
3
+ size 417
train_results.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 25.0,
3
+ "total_flos": 7.547177208649421e+18,
4
+ "train_loss": 1.849292092869351,
5
+ "train_runtime": 122993.5623,
6
+ "train_samples": 2277342,
7
+ "train_samples_per_second": 462.899,
8
+ "train_steps_per_second": 2.893
9
+ }
trainer_state.json ADDED
The diff for this file is too large to render. See raw diff