OleehyO
/

TexTeller

vision-encoder-decoder

image-text-to-text

Inference Endpoints

Model card Files Files and versions Community

TexTeller / README.md

OleehyO's picture

Update README.md

4e06f3f verified 9 months ago

|

890 Bytes

	---
	license: mit
	datasets:
	- OleehyO/latex-formulas
	metrics:
	- bleu
	pipeline_tag: image-to-text
	---
	# About TexTeller
	> [中文版本](./README_zh.md)

	TexTeller is a ViT-based model designed for end-to-end formula recognition. It can recognize formulas in natural images and convert them into LaTeX-style formulas.

	TexTeller is trained on a larger dataset of image-formula pairs (a 550K dataset available [here](https://huggingface.co./datasets/OleehyO/latex-formulas)), exhibits superior generalization ability and higher accuracy compared to [LaTeX-OCR](https://github.com/lukas-blecher/LaTeX-OCR), which uses approximately 100K data points. This larger dataset enables TexTeller to cover most usage scenarios more effectively.

	> For more details, please refer to the [𝐓𝐞𝐱𝐓𝐞𝐥𝐥𝐞𝐫 GitHub repository](https://github.com/OleehyO/TexTeller?tab=readme-ov-file).