KBLab
/

bert-base-swedish-cased-new

Inference Endpoints

Model card Files Files and versions Community

robinq commited on Mar 17, 2022

Commit

d21f40f

•

1 Parent(s): 30c6cbb

Create README.md

Files changed (1) hide show

README.md +14 -0

README.md ADDED Viewed

	@@ -0,0 +1,14 @@

+---
+language:
+- sv
+---
+# 🤗 BERT Swedish
+This BERT model was trained using the 🤗 transformers library.
+The size of the model is a regular BERT-base with 110M parameters.
+The model was trained on about 70GB of data, consisting mostly of OSCAR (25GB) and Swedish newspaper text curated by the National Library of Sweden.
+To avoid excessive padding documents shorter than 512 tokens were concatenated into one large sequence of 512 tokens, and larger documents were split into multiple 512 token sequences, following https://github.com/huggingface/transformers/blob/master/examples/pytorch/language-modeling/run_mlm.py
+Training was done for a bit more than 8 epochs with a batch size of 2048, resulting in a little less than 125k training steps.