3 contributors

History: 41 commits

elishowk

Automatic correction of README.md metadata. Contact [email protected] for any question

83a53bb about 3 years ago

configs
Training dump over 3 years ago
mc4
Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|█████████████████ | 186090/250000 [2:39:06<32:58:04, 1.86s/it] over 3 years ago
outputs
Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|██████████████████████████| 250000/250000 [35:27:20<00:00, 1.68s/it] over 3 years ago
.gitattributes

823 Bytes

Training dump over 3 years ago
.gitignore

38 Bytes

Training dump over 3 years ago
README.md

1.43 kB

Automatic correction of README.md metadata. Contact [email protected] for any question about 3 years ago
config.json

618 Bytes

PyTorch version 180k steps acc 0.6487 over 3 years ago
convert.py

876 Bytes

PyTorch version 180k steps acc 0.6487 over 3 years ago
flax_model.msgpack

250 MB
LFS

Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|██████████████████████████| 250000/250000 [35:27:20<00:00, 1.68s/it] over 3 years ago
mc4-es-train-50M-steps-stats.csv.tar.gz

544 MB
LFS

Dataset stats over 3 years ago
merges.txt

514 kB

PyTorch version 180k steps acc 0.6487 over 3 years ago
push_to_hub.sh

84 Bytes

Step... (188000/250000 | Loss: 1.7402708530426025, Acc: 0.6501697897911072): 75%|█████████████████▎ | 188213/250000 [3:43:59<31:25:35, 1.83s/it] over 3 years ago
pytorch_model.bin
Detected Pickle imports (4)
- "torch._utils._rebuild_tensor_v2",
- "torch.LongStorage",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
499 MB
LFS

Step... (249000/250000 | Loss: 1.714625358581543, Acc: 0.6543225646018982): 100%|██████████████████████████| 250000/250000 [35:27:20<00:00, 1.68s/it] over 3 years ago
run_mlm_flax_stream.py

35.2 kB

Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|█████████████████ | 186090/250000 [2:39:06<32:58:04, 1.86s/it] over 3 years ago
run_stream.128.sh

1 kB

Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|█████████████████ | 186090/250000 [2:39:06<32:58:04, 1.86s/it] over 3 years ago
run_stream.512.sh

978 Bytes

Step... (186000/250000 | Loss: 1.7381113767623901, Acc: 0.6502522826194763): 74%|█████████████████ | 186090/250000 [2:39:06<32:58:04, 1.86s/it] over 3 years ago
run_stream.sh

929 Bytes

Training dump over 3 years ago
special_tokens_map.json

239 Bytes

PyTorch version 180k steps acc 0.6487 over 3 years ago
tokenizer.json

1.47 MB

PyTorch version 180k steps acc 0.6487 over 3 years ago
tokenizer_config.json

292 Bytes

PyTorch version 180k steps acc 0.6487 over 3 years ago
vocab.json

855 kB

PyTorch version 180k steps acc 0.6487 over 3 years ago

Detected Pickle imports (4)