magistermilitum
commited on
Commit
•
6271bb8
1
Parent(s):
bdbf95b
Update README.md
Browse files
README.md
CHANGED
@@ -63,4 +63,14 @@ TRIDIS was trained using a encode-decoder architecture based on a fine-tuned ver
|
|
63 |
This final model operates in a multilingual environment (Latin, Old French, and Old Spanish) and is capable of recognizing several Latin script families (mostly Textualis and Cursiva) in documents produced circa 11th - 16th centuries.
|
64 |
|
65 |
During evaluation, the model showed an accuracy of 94.3% on the validation set and a CER (Character Error Ratio) of about 0.06 to 0.12 on three external unseen datasets
|
66 |
-
and a WER of about 0.14 to 0.26 respectively, which is about 30% lower compared to CRNN+CTC solutions trained on the same corpora.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
63 |
This final model operates in a multilingual environment (Latin, Old French, and Old Spanish) and is capable of recognizing several Latin script families (mostly Textualis and Cursiva) in documents produced circa 11th - 16th centuries.
|
64 |
|
65 |
During evaluation, the model showed an accuracy of 94.3% on the validation set and a CER (Character Error Ratio) of about 0.06 to 0.12 on three external unseen datasets
|
66 |
+
and a WER of about 0.14 to 0.26 respectively, which is about 30% lower compared to CRNN+CTC solutions trained on the same corpora.
|
67 |
+
|
68 |
+
### Other formats
|
69 |
+
A CRNN+CTC version of this model trained on Kraken 4.0 using the same gold-standard annotation is available in Zenodo:
|
70 |
+
|
71 |
+
Torres Aguilar, S., & Jolivet, V. (2024). TRIDIS: HTR model for Multilingual Medieval and Early Modern Documentary Manuscripts (11th-16th) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10800223
|
72 |
+
|
73 |
+
### Paper
|
74 |
+
A journal paper presenting the scientific basis of this models is also available:
|
75 |
+
|
76 |
+
Torres Aguilar, Sergio, Jolivet, Vincent . La reconnaissance de l'écriture pour les manuscrits documentaires du Moyen Âge, Journal of Data Mining & Digital Humanities, 22 décembre 2023 - https://hal.science/hal-03892163/document
|