sbintuitions
/

modernbert-ja-310m

Inference Endpoints

Model card Files Files and versions Community

hpprc commited on 9 days ago

Commit

ca9f86e

·

verified ·

1 Parent(s): a28761f

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -181,6 +181,7 @@ For datasets with predefined `train`, `validation`, and `test` sets, we simply t
 The evaluation results are shown in the table.
 `#Param.` represents the number of parameters in both the input embedding layer and the Transformer layers, while `#Param. w/o Emb.` indicates the number of parameters in the Transformer layers only.
 Despite being a long-context model capable of processing sequences of up to 8,192 tokens, our ModernBERT-Ja-310M also exhibited strong performance in short-sequence evaluations.
 ## Ethical Considerations

 The evaluation results are shown in the table.
 `#Param.` represents the number of parameters in both the input embedding layer and the Transformer layers, while `#Param. w/o Emb.` indicates the number of parameters in the Transformer layers only.
+According to our evaluation results, our ModernBERT-Ja-310M archives **state-of-the-art** performance across the evaluation tasks, even when compared with much larger models.
 Despite being a long-context model capable of processing sequences of up to 8,192 tokens, our ModernBERT-Ja-310M also exhibited strong performance in short-sequence evaluations.
 ## Ethical Considerations