Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
bertin-project
/
bertin-base-stepwise
like
0
Follow
BERTIN Project
19
Fill-Mask
Transformers
PyTorch
JAX
TensorBoard
Joblib
Spanish
roberta
spanish
Inference Endpoints
License:
cc-by-4.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
0f5bde8
bertin-base-stepwise
/
outputs
3 contributors
History:
35 commits
versae
Step... (247000/250000 | Loss: 1.7176971435546875, Acc: 0.6535604000091553): 99%|βββββββββββββββββββββββ| 247327/250000 [34:04:54<1:23:09, 1.87s/it]
0f5bde8
over 3 years ago
checkpoints
Step... (247000/250000 | Loss: 1.7176971435546875, Acc: 0.6535604000091553): 99%|βββββββββββββββββββββββ| 247327/250000 [34:04:54<1:23:09, 1.87s/it]
over 3 years ago
config.json
Safe
618 Bytes
Training dump
over 3 years ago
data_collator.joblib
pickle
Detected Pickle imports (5)
"tokenizers.AddedToken"
,
"__main__.FlaxDataCollatorForLanguageModeling"
,
"tokenizers.models.Model"
,
"transformers.models.roberta.tokenization_roberta_fast.RobertaTokenizerFast"
,
"tokenizers.Tokenizer"
How to fix it?
1.47 MB
LFS
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 3 years ago
events.out.tfevents.1626172316.underestimate.4022703.3.v2
Safe
27.7 MB
LFS
Dataset stats
over 3 years ago
events.out.tfevents.1627122688.tablespoon.2185269.3.v2
Safe
40 Bytes
LFS
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 3 years ago
events.out.tfevents.1627122817.tablespoon.2191003.3.v2
Safe
149 kB
LFS
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 3 years ago
events.out.tfevents.1627125745.tablespoon.2266135.3.v2
Safe
149 kB
LFS
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 3 years ago
events.out.tfevents.1627128247.tablespoon.2330108.3.v2
Safe
9.85 MB
LFS
Step... (247000/250000 | Loss: 1.7176971435546875, Acc: 0.6535604000091553): 99%|βββββββββββββββββββββββ| 247327/250000 [34:04:54<1:23:09, 1.87s/it]
over 3 years ago
flax_model.msgpack
Safe
250 MB
LFS
Step... (247000/250000 | Loss: 1.7176971435546875, Acc: 0.6535604000091553): 99%|βββββββββββββββββββββββ| 247327/250000 [34:04:54<1:23:09, 1.87s/it]
over 3 years ago
optimizer_state.msgpack
Safe
500 MB
LFS
Step... (247000/250000 | Loss: 1.7176971435546875, Acc: 0.6535604000091553): 99%|βββββββββββββββββββββββ| 247327/250000 [34:04:54<1:23:09, 1.87s/it]
over 3 years ago
training_args.joblib
pickle
Detected Pickle imports (4)
"transformers.trainer_utils.SchedulerType"
,
"torch.device"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.TrainingArguments"
How to fix it?
1.87 kB
LFS
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|βββββββββββββββββ | 186090/250000 [2:39:06<32:58:04, 1.86s/it]
over 3 years ago
training_state.json
Safe
16 Bytes
Step... (247000/250000 | Loss: 1.7176971435546875, Acc: 0.6535604000091553): 99%|βββββββββββββββββββββββ| 247327/250000 [34:04:54<1:23:09, 1.87s/it]
over 3 years ago