File size: 890 Bytes
f201eb3 4e06f3f f201eb3 e979a21 f201eb3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
---
license: mit
datasets:
- OleehyO/latex-formulas
metrics:
- bleu
pipeline_tag: image-to-text
---
# About TexTeller
> [中文版本](./README_zh.md)
TexTeller is a ViT-based model designed for end-to-end formula recognition. It can recognize formulas in natural images and convert them into LaTeX-style formulas.
TexTeller is trained on a larger dataset of image-formula pairs (a 550K dataset available [here](https://huggingface.co./datasets/OleehyO/latex-formulas)), **exhibits superior generalization ability and higher accuracy compared to [LaTeX-OCR](https://github.com/lukas-blecher/LaTeX-OCR)**, which uses approximately 100K data points. This larger dataset enables TexTeller to cover most usage scenarios more effectively.
> For more details, please refer to the [𝐓𝐞𝐱𝐓𝐞𝐥𝐥𝐞𝐫 GitHub repository](https://github.com/OleehyO/TexTeller?tab=readme-ov-file). |