mboillet commited on
Commit
9c22c3e
1 Parent(s): c4fbaf9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -11
README.md CHANGED
@@ -25,12 +25,12 @@ This model performs Handwritten Text Recognition in English on modern documents.
25
 
26
  ## Model description
27
 
28
- The model was trained using the PyLaia library on the RWTH split of the [IAM database](https://fki.tic.heia-fr.ch/databases/iam-handwriting-database).
29
 
30
- For training, text-lines were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
31
 
32
- | split | N lines |
33
- | ----- | ------: |
34
  | train | 6,482 |
35
  | val | 976 |
36
  | test | 2,915 |
@@ -41,22 +41,26 @@ An external 6-gram character language model can be used to improve recognition.
41
 
42
  The model achieves the following results:
43
 
44
- | set | Language model | CER (%) | WER (%) | N lines |
45
  |:------|:---------------| ----------:| -------:|----------:|
46
  | test | no | 8.44 | 24.51 | 2,915 |
47
  | test | yes | 7.50 | 20.98 | 2,915 |
48
 
49
  ## How to use?
50
 
51
- Please refer to the [documentation](https://atr.pages.teklia.com/pylaia/).
52
 
53
  ## Cite us!
54
 
55
  ```bibtex
56
- @inproceedings{pylaia-lib,
57
- author = "Tarride, Solène and Schneider, Yoann and Generali, Marie and Boillet, Melodie and Abadie, Bastien and Kermorvant, Christopher",
58
- title = "Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library",
59
- booktitle = "Submitted at ICDAR2024",
60
- year = "2024"
 
 
 
 
61
  }
62
  ```
 
25
 
26
  ## Model description
27
 
28
+ The model was trained using the PyLaia library on the RWTH split of the [IAM](https://fki.tic.heia-fr.ch/databases/iam-handwriting-database) dataset.
29
 
30
+ Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
31
 
32
+ | set | lines |
33
+ | :--- | ------: |
34
  | train | 6,482 |
35
  | val | 976 |
36
  | test | 2,915 |
 
41
 
42
  The model achieves the following results:
43
 
44
+ | set | Language model | CER (%) | WER (%) | lines |
45
  |:------|:---------------| ----------:| -------:|----------:|
46
  | test | no | 8.44 | 24.51 | 2,915 |
47
  | test | yes | 7.50 | 20.98 | 2,915 |
48
 
49
  ## How to use?
50
 
51
+ Please refer to the [PyLaia documentation](https://atr.pages.teklia.com/pylaia/usage/prediction/) to use this model.
52
 
53
  ## Cite us!
54
 
55
  ```bibtex
56
+ @inproceedings{pylaia2024,
57
+ author = {Tarride, Solène and Schneider, Yoann and Generali-Lince, Marie and Boillet, Mélodie and Abadie, Bastien and Kermorvant, Christopher},
58
+ title = {{Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library}},
59
+ booktitle = {Document Analysis and Recognition - ICDAR 2024},
60
+ year = {2024},
61
+ publisher = {Springer Nature Switzerland},
62
+ address = {Cham},
63
+ pages = {387--404},
64
+ isbn = {978-3-031-70549-6}
65
  }
66
  ```